Polypeptides Having Cellulase Activity and Polynucleotides Encoding Same

ABSTRACT

The present invention relates to isolated polypeptides having cellulase activity. The invention also relates to nucleic acid constructs, vectors, and host cells comprising the polynucleotides as well as methods of producing and using the polypeptides.

REFERENCE TO A SEQUENCE LISTING

This application contains a Sequence Listing in computer readable form,which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to polypeptides having cellulase activityand polynucleotides encoding the polypeptides. The invention alsorelates to nucleic acid constructs, vectors, and host cells comprisingthe polynucleotides as well as methods of producing and using thepolypeptides.

2. Description of the Related Art

Terebella lapidaria (Linaeus, 1767) is a polychaete living only inmarine environments. Specimens are typically 3-9 cm in length and 4-5 mmin diameter.

T. lapidaria are tube builders that cement sand particles together intoa tube habitat. They scavenge organic material, algae and or smallanimals with their numerous grooved palps. Terebella lapidaria is amarine specie with a wide ecologic distribution, that can be found inthe intertidal and subtidal in up to 30-40 m depth, both in mobilesubstrates such as sand, muddy sand and mud but can also be found onrigid substrates such as rock strata.

Like many sand and detritus eating polychaetes, Terebella has a powerfuldigestive biosurfactant system and enzymes produced by the digestivesystem have obviously evolved to work efficiently in surfactantenvironments at or around neutral pH.

The present invention provides polypeptides having cellulase activityand polynucleotides encoding the polypeptides.

SUMMARY OF THE INVENTION

The present invention relates to isolated polypeptides having cellulaseactivity selected from the group consisting of:

(a) a polypeptide having at least 60% sequence identity to the maturepolypeptide of SEQ ID NO:2;

(b) a polypeptide encoded by a polynucleotide that hybridizes undermedium stringency conditions with (i) the mature polypeptide codingsequence of SEQ ID NO:1, (ii) the genomic DNA sequence thereof, or (iii)the full-length complement of (i) or (ii);

(c) a polypeptide encoded by a polynucleotide having at least 60%sequence identity to the mature polypeptide coding sequence of SEQ IDNO:1 or the genomic DNA sequence thereof;

(d) a variant of the mature polypeptide of SEQ ID NO:2 comprising asubstitution, deletion, and/or insertion at one or more (e.g., several)positions; and

(e) a fragment of the polypeptide of (a), (b), (c) or (d) that hascellulase activity.

The present invention also relates to isolated polypeptides comprising aGH9 catalytic domain selected from the group consisting of:

(a) a catalytic domain having at least 60% sequence identity to aminoacids 1 to 428 of SEQ ID NO:2;

(b) a catalytic domain encoded by a polynucleotide that hybridizes undermedium stringency conditions with (i) nucleotides 12 to 1295 of SEQ IDNO:1, (ii) the genomic DNA sequence thereof, or (iii) the full-lengthcomplement of (i) or (ii);

(c) a catalytic domain encoded by a polynucleotide having at least 60%sequence identity to nucleotides 12 to 1295 of SEQ ID NO:1 or thegenomic DNA sequence thereof;

(d) a variant of amino acids 1 to 428 of SEQ ID NO:2 comprising asubstitution, deletion, and/or insertion at one or more (e.g., several)positions; and

(e) a fragment of the GH9 catalytic domain of (a), (b), (c) or (d) thathas cellulase activity.

The present invention also relates to isolated polynucleotides encodingthe polypeptides of the present invention; nucleic acid constructs;recombinant expression vectors; recombinant host cells comprising thepolynucleotides; and methods of producing the polypeptides.

The present invention also relates to compositions comprising thecellulose of the invention, in particular detergent compositions, andcompositions for textile treatment and for pulp and paper treatment.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows the pH optimum for the cellulase, determined using PASOwith a reducing sugar assay. The enzyme displayed a broad pH optimum andthe optimal pH for this enzyme was 6. The activity was above 80% maxactivity from pH 6 to 9. Diamonds, PASO+Enzyme; Triangles, PASO−Enzyme.

FIG. 2 shows the temperature profile for Terebella lapidaria determinedon 0.2% AZCL-HE-Cellulose in 0.1 M Na-phosphate, pH 7.

FIG. 3 shows the wash performance of the Terebella cellulase, columnmarked U33M5, compared with two commercial products, Endolase andWhitezyme.

DEFINITIONS

Cellulase: The term “cellulase” means an endo β-1,4-glucanase activity(EC 3.2.1.4.) that catalyzes the hydrolyses of a β-1,4-bindingconnecting two glucosyl residues in a β-1,4-glucan. For purposes of thepresent invention, cellulase activity is determined according to theprocedure described in the Examples.

The polypeptides of the present invention have at least 20%, e.g., atleast 40%, at least 50%, at least 60%, at least 70%, at least 80%, atleast 90%, at least 95%, or at least 100% of the cellulase activity ofthe mature polypeptide of SEQ ID NO:2.

Endoglucanase: The term “endoglucanase” means anendo-1,4-(1,3;1,4)-beta-D-glucan 4-glucanohydrolase (E.C. 3.2.1.4),which catalyses endohydrolysis of 1,4-beta-D-glycosidic linkages incellulose, cellulose derivatives (such as carboxymethyl cellulose andhydroxyethyl cellulose), lichenin, beta-1,4 bonds in mixed beta-1,3glucans such as cereal beta-D-glucans or xyloglucans, and other plantmaterial containing cellulosic components. Endoglucanase activity can bedetermined by measuring reduction in substrate viscosity or increase inreducing ends determined by a reducing sugar assay (Zhang et al., 2006,Biotechnology Advances 24: 452-481). For purposes of the presentinvention, endoglucanase activity is determined using carboxymethylcellulose (CMC) as substrate according to the procedure of Ghose, 1987,Pure and Appl. Chem. 59: 257-268, at pH 5, 40° C.

Allelic variant: The term “allelic variant” means any of two or morealternative forms of a gene occupying the same chromosomal locus.Allelic variation arises naturally through mutation, and may result inpolymorphism within populations. Gene mutations can be silent (no changein the encoded polypeptide) or may encode polypeptides having alteredamino acid sequences. An allelic variant of a polypeptide is apolypeptide encoded by an allelic variant of a gene.

Binding domain: The term “cellulose binding domain” means the region ofan enzyme that mediates binding of the enzyme to amorphous regions of acellulose substrate. The cellulose binding domain (CBD) is typicallyfound either at the N-terminal or at the C-terminal extremity of anendo-glucanase.

Catalytic domain: The term “catalytic domain” means the region of anenzyme containing the catalytic machinery of the enzyme.

cDNA: The term “cDNA” means a DNA molecule that can be prepared byreverse transcription from a mature, spliced, mRNA molecule obtainedfrom a eukaryotic or prokaryotic cell. cDNA lacks intron sequences thatmay be present in the corresponding genomic DNA. The initial, primaryRNA transcript is a precursor to mRNA that is processed through a seriesof steps, including splicing, before appearing as mature spliced mRNA.

Coding sequence: The term “coding sequence” means a polynucleotide,which directly specifies the amino acid sequence of a polypeptide. Theboundaries of the coding sequence are generally determined by an openreading frame, which begins with a start codon such as ATG, GTG, or TTGand ends with a stop codon such as TAA, TAG, or TGA. The coding sequencemay be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.

Control sequences: The term “control sequences” means nucleic acidsequences necessary for expression of a polynucleotide encoding a maturepolypeptide of the present invention. Each control sequence may benative (i.e., from the same gene) or foreign (i.e., from a differentgene) to the polynucleotide encoding the polypeptide or native orforeign to each other. Such control sequences include, but are notlimited to, a leader, polyadenylation sequence, propeptide sequence,promoter, signal peptide sequence, and transcription terminator. At aminimum, the control sequences include a promoter, and transcriptionaland translational stop signals. The control sequences may be providedwith linkers for the purpose of introducing specific restriction sitesfacilitating ligation of the control sequences with the coding region ofthe polynucleotide encoding a polypeptide.

Expression: The term “expression” includes any step involved in theproduction of a polypeptide including, but not limited to,transcription, post-transcriptional modification, translation,post-translational modification, and secretion.

Expression vector: The term “expression vector” means a linear orcircular DNA molecule that comprises a polynucleotide encoding apolypeptide and is operably linked to control sequences that provide forits expression.

Fragment: The term “fragment” means a polypeptide or a catalytic domainhaving one or more (e.g., several) amino acids deleted from the aminoand/or carboxyl terminus of a mature polypeptide or domain; wherein thefragment has cellulase activity. In one aspect, a fragment contains atleast 300 amino acid residues such as the fragment containing aminoacids 3-428 of SEQ ID NO: 2.

High stringency conditions: The term “high stringency conditions” meansfor probes of at least 100 nucleotides in length, prehybridization andhybridization at 42° C. in 5×SSPE, 0.3% SDS, 200 micrograms/ml shearedand denatured salmon sperm DNA, and 50% formamide, following standardSouthern blotting procedures for 12 to 24 hours. The carrier material isfinally washed three times each for 15 minutes using 2×SSC, 0.2% SDS at65° C.

Host cell: The term “host cell” means any cell type that is susceptibleto transformation, transfection, transduction, or the like with anucleic acid construct or expression vector comprising a polynucleotideof the present invention. The term “host cell” encompasses any progenyof a parent cell that is not identical to the parent cell due tomutations that occur during replication.

Isolated: The term “isolated” means a substance in a form or environmentthat does not occur in nature. Non-limiting examples of isolatedsubstances include (1) any non-naturally occurring substance, (2) anysubstance including, but not limited to, any enzyme, variant, nucleicacid, protein, peptide or cofactor, that is at least partially removedfrom one or more or all of the naturally occurring constituents withwhich it is associated in nature; (3) any substance modified by the handof man relative to that substance found in nature; or (4) any substancemodified by increasing the amount of the substance relative to othercomponents with which it is naturally associated (e.g., multiple copiesof a gene encoding the substance; use of a stronger promoter than thepromoter naturally associated with the gene encoding the substance). Anisolated substance may be present in a fermentation broth sample.

Low stringency conditions: The term “low stringency conditions” meansfor probes of at least 100 nucleotides in length, prehybridization andhybridization at 42° C. in 5×SSPE, 0.3% SDS, 200 micrograms/ml shearedand denatured salmon sperm DNA, and 25% formamide, following standardSouthern blotting procedures for 12 to 24 hours. The carrier material isfinally washed three times each for 15 minutes using 2×SSC, 0.2% SDS at50° C.

Mature polypeptide: The term “mature polypeptide” means a polypeptide inits final form following translation and any post-translationalmodifications, such as N-terminal processing, C-terminal truncation,glycosylation, phosphorylation, etc. In one aspect, the maturepolypeptide is amino acids 1 to 428 of SEQ ID NO:2. It is known in theart that a host cell may produce a mixture of two of more differentmature polypeptides (i.e., with a different C-terminal and/or N-terminalamino acid) expressed by the same polynucleotide.

Mature polypeptide coding sequence: The term “mature polypeptide codingsequence” means a polynucleotide that encodes a mature polypeptidehaving cellulase activity. In one aspect, the mature polypeptide codingsequence is nucleotides 12 to 1295 of SEQ ID NO:1 or the genomic DNAsequence thereof.

Medium stringency conditions: The term “medium stringency conditions”means for probes of at least 100 nucleotides in length, prehybridizationand hybridization at 42° C. in 5×SSPE, 0.3% SDS, 200 micrograms/mlsheared and denatured salmon sperm DNA, and 35% formamide, followingstandard Southern blotting procedures for 12 to 24 hours. The carriermaterial is finally washed three times each for 15 minutes using 2×SSC,0.2% SDS at 55° C.

Medium-high stringency conditions: The term “medium-high stringencyconditions” means for probes of at least 100 nucleotides in length,prehybridization and hybridization at 42° C. in 5×SSPE, 0.3% SDS, 200micrograms/ml sheared and denatured salmon sperm DNA, and either 35%formamide, following standard Southern blotting procedures for 12 to 24hours. The carrier material is finally washed three times each for 15minutes using 2×SSC, 0.2% SDS at 60° C.

Nucleic acid construct: The term “nucleic acid construct” means anucleic acid molecule, either single- or double-stranded, which isisolated from a naturally occurring gene or is modified to containsegments of nucleic acids in a manner that would not otherwise exist innature or which is synthetic, which comprises one or more controlsequences.

Operably linked: The term “operably linked” means a configuration inwhich a control sequence is placed at an appropriate position relativeto the coding sequence of a polynucleotide such that the controlsequence directs the expression of the coding sequence.

Purified: The term “purified” means a polypeptide or polynucleotide thatis removed from at least one component with which it is naturallyassociated. For example, a polypeptide may be at least 1% pure, e.g., atleast 5% pure, at least 10% pure, at least 20% pure, at least 40% pure,at least 60% pure, at least 80% pure, at least 90% pure, or at least 95%pure, as determined by SDS-PAGE, and a polynucleotide may be at least 1%pure, e.g., at least 5% pure, at least 10% pure, at least 20% pure, atleast 40% pure, at least 60% pure, at least 80% pure, at least 90% pure,or at least 95% pure, as determined by agarose electrophoresis.

Sequence identity: The relatedness between two amino acid sequences orbetween two nucleotide sequences is described by the parameter “sequenceidentity”.

For purposes of the present invention, the sequence identity between twoamino acid sequences is determined using the Needleman-Wunsch algorithm(Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) for example asimplemented in the Needle program of the EMBOSS package (EMBOSS: TheEuropean Molecular Biology Open Software Suite, Rice et al., 2000,Trends Genet. 16: 276-277), preferably version 5.0.0 or later. Theparameters used are gap open penalty of 10, gap extension penalty of0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix.The output of Needle labeled “longest identity” (obtained usingthe—nobrief option) is used as the percent identity and is calculated asfollows:

(Identical Residues×100)/(Length of Alignment−Total Number of Gaps inAlignment)

For purposes of the present invention, the sequence identity between twodeoxyribonucleotide sequences is determined using the Needleman-Wunschalgorithm (Needleman and Wunsch, 1970, supra) as implemented in theNeedle program of the EMBOSS package (EMBOSS: The European MolecularBiology Open Software Suite, Rice et al., 2000, supra), preferablyversion 5.0.0 or later. The parameters used are gap open penalty of 10,gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBINUC4.4) substitution matrix. The output of Needle labeled “longestidentity” (obtained using the—nobrief option) is used as the percentidentity and is calculated as follows:

(Identical Deoxyribonucleotides×100)/(Length of Alignment−Total Numberof Gaps in Alignment)

Subsequence: The term “subsequence” means a polynucleotide having one ormore (e.g., several) nucleotides deleted from the 5′ and/or 3′ end of amature polypeptide coding sequence; wherein the subsequence encodes afragment having cellulase activity. In one aspect, a subsequencecontains at least 900 nucleotides (e.g., nucleotides 9 to 1293 of SEQ IDNO:1.

Variant: The term “variant” means a polypeptide having cellulaseactivity comprising an alteration, i.e., a substitution, insertion,and/or deletion of one or more (e.g., several) amino acid residues atone or more positions. A substitution means a replacement of the aminoacid occupying a position with a different amino acid; a deletion meansremoval of the amino acid occupying a position; and an insertion meansadding an amino acid adjacent to the amino acid occupying a position.

Very high stringency conditions: The term “very high stringencyconditions” means for probes of at least 100 nucleotides in length,prehybridization and hybridization at 42° C. in 5×SSPE, 0.3% SDS, 200micrograms/ml sheared and denatured salmon sperm DNA, and 50% formamide,following standard Southern blotting procedures for 12 to 24 hours. Thecarrier material is finally washed three times each for 15 minutes using2×SSC, 0.2% SDS at 70° C.

Very low stringency conditions: The term “very low stringencyconditions” means for probes of at least 100 nucleotides in length,prehybridization and hybridization at 42° C. in 5×SSPE, 0.3% SDS, 200micrograms/ml sheared and denatured salmon sperm DNA, and 25% formamide,following standard Southern blotting procedures for 12 to 24 hours. Thecarrier material is finally washed three times each for 15 minutes using2×SSC, 0.2% SDS at 45° C.

DETAILED DESCRIPTION OF THE INVENTION Polypeptides Having CellulaseActivity

In an embodiment, the present invention relates to isolated polypeptideshaving a sequence identity to the mature polypeptide of SEQ ID NO:2 ofat least 60%, e.g., at least 65%, at least 70%, at least 75%, at least80%, at least 85%, at least 90%, at least 91%, at least 92%, at least93%, at least 94%, at least 95%, at least 96%, at least 97%, at least98%, at least 99%, or 100%, which have cellulase activity. In oneaspect, the polypeptides differ by no more than ten amino acids, e.g.,nine amino acids, eight amino acids, seven amino acids, six amino acids,five amino acids, four amino acids, three amino acids, two amino acids,or one amino acid from the mature polypeptide of SEQ ID NO:2.

A polypeptide of the present invention preferably comprises or consistsof the amino acid sequence of SEQ ID NO:2 or an allelic variant thereof;or is a fragment thereof having cellulase activity. In another aspect,the polypeptide comprises or consists of the mature polypeptide of SEQID NO:2. In another preferred aspect, the polypeptide comprises orconsists of amino acids 1 to 428 of SEQ ID NO:2.

In another embodiment, the present invention relates to isolatedpolypeptides having cellulase activity that are encoded by apolynucleotide that hybridizes under very low stringency conditions, lowstringency conditions, medium stringency conditions, medium-highstringency conditions, high stringency conditions, or very highstringency conditions with (i) the mature polypeptide coding sequence ofSEQ ID NO:1, (ii) the genomic DNA sequence thereof, or (iii) thefull-length complement of (i) or (ii) (Sambrook et al., 1989, MolecularCloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.).

The polynucleotide of SEQ ID NO:1 or a subsequence thereof, as well asthe polypeptide of SEQ ID NO:2 or a fragment thereof, may be used todesign nucleic acid probes to identify and clone DNA encodingpolypeptides having cellulose activity from strains of different generaor species according to methods well known in the art. In particular,such probes can be used for hybridization with the genomic DNA or cDNAof a cell of interest, following standard Southern blotting procedures,in order to identify and isolate the corresponding gene therein. Suchprobes can be considerably shorter than the entire sequence, but shouldbe at least 15, e.g., at least 25, at least 35, or at least 70nucleotides in length. Preferably, the nucleic acid probe is at least100 nucleotides in length, e.g., at least 200 nucleotides, at least 300nucleotides, at least 400 nucleotides, at least 500 nucleotides, atleast 600 nucleotides, at least 700 nucleotides, at least 800nucleotides, or at least 900 nucleotides in length. Both DNA and RNAprobes can be used. The probes are typically labeled for detecting thecorresponding gene (for example, with ³²P, ³H, ³⁵S, biotin, or avidin).Such probes are encompassed by the present invention.

A genomic DNA or cDNA library prepared from such other strains may bescreened for DNA that hybridizes with the probes described above andencodes a polypeptide having cellulase activity. Genomic or other DNAfrom such other strains may be separated by agarose or polyacrylamidegel electrophoresis, or other separation techniques. DNA from thelibraries or the separated DNA may be transferred to and immobilized onnitrocellulose or other suitable carrier material. In order to identifya clone or DNA that is homologous with SEQ ID NO:1 or a subsequencethereof, the carrier material is preferably used in a Southern blot.

For purposes of the present invention, hybridization indicates that thepolynucleotide hybridizes to a labeled nucleic acid probe correspondingto (i) SEQ ID NO:1; (ii) the mature polypeptide coding sequence of SEQID NO:1; (iii) or the genomic DNA sequence thereof; (iv) the full-lengthcomplement thereof; or (v) a subsequence thereof; under very low to veryhigh stringency conditions. Molecules to which the nucleic acid probehybridizes under these conditions can be detected using, for example,X-ray film.

In one aspect, the nucleic acid probe is nucleotides 12 to 1295 of SEQID NO:1. In another aspect, the nucleic acid probe is a polynucleotidethat encodes the polypeptide of SEQ ID NO:2; the mature polypeptidethereof; or a fragment thereof. In another aspect, the nucleic acidprobe is SEQ ID NO:1 or the genomic sequence thereof.

For probes of at least 100 nucleotides in length, very low stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and 25% formamide, following standard Southern blottingprocedures for 12 to 24 hours optimally. The carrier material is finallywashed three times each for 15 minutes using 2×SSC, 0.2% SDS at 45° C.

For probes of at least 100 nucleotides in length, low stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and 25% formamide, following standard Southern blottingprocedures for 12 to 24 hours optimally. The carrier material is finallywashed three times each for 15 minutes using 2×SSC, 0.2% SDS at 50° C.

For probes of at least 100 nucleotides in length, medium stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and 35% formamide, following standard Southern blottingprocedures for 12 to 24 hours optimally. The carrier material is finallywashed three times each for 15 minutes using 2×SSC, 0.2% SDS at 55° C.

For probes of at least 100 nucleotides in length, medium-high stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and either 35% formamide, following standard Southernblotting procedures for 12 to 24 hours optimally. The carrier materialis finally washed three times each for 15 minutes using 2×SSC, 0.2% SDSat 60° C.

For probes of at least 100 nucleotides in length, high stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and 50% formamide, following standard Southern blottingprocedures for 12 to 24 hours optimally. The carrier material is finallywashed three times each for 15 minutes using 2×SSC, 0.2% SDS at 65° C.

For probes of at least 100 nucleotides in length, very high stringencyconditions are defined as prehybridization and hybridization at 42° C.in 5×SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmonsperm DNA, and 50% formamide, following standard Southern blottingprocedures for 12 to 24 hours optimally. The carrier material is finallywashed three times each for 15 minutes using 2×SSC, 0.2% SDS at 70° C.

In another embodiment, the present invention relates to isolatedpolypeptides having cellulase activity encoded by polynucleotides havinga sequence identity to the mature polypeptide coding sequence of SEQ IDNO:1 or the genomic DNA sequence thereof of at least 60%, e.g., at least65%, at least 70%, at least 75%, at least 80%, at least 85%, at least90%, at least 91%, at least 92%, at least 93%, at least 94%, at least95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%.

In another embodiment, the present invention relates to variants of themature polypeptide of SEQ ID NO:2 comprising a substitution, deletion,and/or insertion at one or more (e.g., several) positions. Preferably,amino acid changes are of a minor nature, that is conservative aminoacid substitutions or insertions that do not significantly affect thefolding and/or activity of the protein; small deletions, typically ofone to about 30 amino acids; small amino- or carboxyl-terminalextensions, such as an amino-terminal methionine residue; a small linkerpeptide of up to about 20-residues; or a small extension thatfacilitates purification by changing net charge or another function,such as a poly-histidine tract, an antigenic epitope or a bindingdomain.

Examples of conservative substitutions are within the groups of basicamino acids (arginine, lysine and histidine), acidic amino acids(glutamic acid and aspartic acid), polar amino acids (glutamine andasparagine), hydrophobic amino acids (leucine, isoleucine and valine),aromatic amino acids (phenylalanine, tryptophan and tyrosine), and smallamino acids (glycine, alanine, serine, threonine and methionine). Aminoacid substitutions that do not generally alter specific activity areknown in the art and are described, for example, by H. Neurath and R. L.Hill, 1979, In, The Proteins, Academic Press, New York. Commonsubstitutions are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr,Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile,Leu/Val, Ala/Glu, and Asp/Gly.

Alternatively, the amino acid changes are of such a nature that thephysico-chemical properties of the polypeptides are altered. Forexample, amino acid changes may improve the thermal stability of thepolypeptide, alter the substrate specificity, change the pH optimum, andthe like.

Essential amino acids in a polypeptide can be identified according toprocedures known in the art, such as site-directed mutagenesis oralanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244:1081-1085). In the latter technique, single alanine mutations areintroduced at every residue in the molecule, and the resultant mutantmolecules are tested for cellulase activity to identify amino acidresidues that are critical to the activity of the molecule. See also,Hilton et al., 1996, J. Biol. Chem. 271: 4699-4708. The active site ofthe enzyme or other biological interaction can also be determined byphysical analysis of structure, as determined by such techniques asnuclear magnetic resonance, crystallography, electron diffraction, orphotoaffinity labeling, in conjunction with mutation of putative contactsite amino acids. See, for example, de Vos et al., 1992, Science 255:306-312; Smith et al., 1992, J. Mol. Biol. 224: 899-904; Wlodaver etal., 1992, FEBS Lett. 309: 59-64. The identity of essential amino acidscan also be inferred from an alignment with a related polypeptide.

Essential amino acids in the sequence of amino acids 1 to 428 of SEQ IDNO: 2 are located in positions Asp56, Asp69 and Glu216. Consequentlythese positions should not be substituted or deleted.

Single or multiple amino acid substitutions, deletions, and/orinsertions can be made and tested using known methods of mutagenesis,recombination, and/or shuffling, followed by a relevant screeningprocedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988,Science 241: 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA86: 2152-2156; WO 95/17413; or WO 95/22625. Other methods that can beused include error-prone PCR, phage display (e.g., Lowman et al., 1991,Biochemistry 30: 10832-10837; U.S. Pat. No. 5,223,409; WO 92/06204), andregion-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Neret al., 1988, DNA 7: 127).

Mutagenesis/shuffling methods can be combined with high-throughput,automated screening methods to detect activity of cloned, mutagenizedpolypeptides expressed by host cells (Ness et al., 1999, NatureBiotechnology 17: 893-896). Mutagenized DNA molecules that encode activepolypeptides can be recovered from the host cells and rapidly sequencedusing standard methods in the art. These methods allow the rapiddetermination of the importance of individual amino acid residues in apolypeptide.

In an embodiment, the number of amino acid substitutions, deletionsand/or insertions introduced into the mature polypeptide of SEQ ID NO:2is not more than 10, e.g., 1, 2, 3, 4, 5, 6, 7, 8 or 9.

The polypeptide may be a hybrid polypeptide in which a region of onepolypeptide is fused at the N-terminus or the C-terminus of a region ofanother polypeptide.

The polypeptide may be a fusion polypeptide or cleavable fusionpolypeptide in which another polypeptide is fused at the N-terminus orthe C-terminus of the polypeptide of the present invention. A fusionpolypeptide is produced by fusing a polynucleotide encoding anotherpolypeptide to a polynucleotide of the present invention. Techniques forproducing fusion polypeptides are known in the art, and include ligatingthe coding sequences encoding the polypeptides so that they are in frameand that expression of the fusion polypeptide is under control of thesame promoter(s) and terminator. Fusion polypeptides may also beconstructed using intein technology in which fusion polypeptides arecreated post-translationally (Cooper et al., 1993, EMBO J. 12:2575-2583; Dawson et al., 1994, Science 266: 776-779).

A fusion polypeptide can further comprise a cleavage site between thetwo polypeptides. Upon secretion of the fusion protein, the site iscleaved releasing the two polypeptides. Examples of cleavage sitesinclude, but are not limited to, the sites disclosed in Martin et al.,2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000,J. Biotechnol. 76: 245-251; Rasmussen-Wilson et al., 1997, Appl.Environ. Microbiol. 63: 3488-3493; Ward et al., 1995, Biotechnology 13:498-503; and Contreras et al., 1991, Biotechnology 9: 378-381; Eaton etal., 1986, Biochemistry 25: 505-512; Collins-Racie et al., 1995,Biotechnology 13: 982-987; Carter et al., 1989, Proteins: Structure,Function, and Genetics 6: 240-248; and Stevens, 2003, Drug DiscoveryWorld 4: 35-48.

Sources of Polypeptides Having Cellulase Activity

A polypeptide having cellulase activity of the present invention may beobtained from microorganisms of any genus. For purposes of the presentinvention, the term “obtained from” as used herein in connection with agiven source shall mean that the polypeptide encoded by a polynucleotideis produced by the source or by a strain in which the polynucleotidefrom the source has been inserted. In one aspect, the polypeptideobtained from a given source is secreted extracellularly.

The polypeptide may be a bacterial polypeptide. For example, thepolypeptide may be a Gram-positive bacterial polypeptide such as aBacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus,Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, orStreptomyces polypeptide having cellulase activity, or a Gram-negativebacterial polypeptide such as a Campylobacter, E. coli, Flavobacterium,Fusobacterium, Helicobacter, Ilyobacter, Neisseria, Pseudomonas,Salmonella, or Ureaplasma polypeptide.

In one aspect, the polypeptide is a Bacillus alkalophilus, Bacillusamyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillusclausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacilluslentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus,Bacillus stearothermophilus, Bacillus subtilis, or Bacillusthuringiensis polypeptide.

In another aspect, the polypeptide is a Streptococcus equisimilis,Streptococcus pyogenes, Streptococcus uberis, or Streptococcus equisubsp. Zooepidemicus polypeptide.

In another aspect, the polypeptide is a Streptomyces achromogenes,Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus,or Streptomyces lividans polypeptide.

The polypeptide may also be a fungal polypeptide. For example, thepolypeptide may be a yeast polypeptide such as a Candida, Kluyveromyces,Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide; ora filamentous fungal polypeptide such as an Acremonium, Agaricus,Alternaria, Aspergillus, Aureobasidium, Botryospaeria, Ceriporiopsis,Chaetomidium, Chrysosporium, Claviceps, Cochliobolus, Coprinopsis,Coptotermes, Corynascus, Cryphonectria, Cryptococcus, Diplodia, Exidia,Filibasidium, Fusarium, Gibberella, Holomastigotoides, Humicola, Irpex,Lentinula, Leptospaeria, Magnaporthe, Melanocarpus, Meripilus, Mucor,Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Phanerochaete,Piromyces, Poitrasia, Pseudoplectania, Pseudotrichonympha, Rhizomucor,Schizophyllum, Scytalidium, Talaromyces, Thermoascus, Thielavia,Tolypocladium, Trichoderma, Trichophaea, Verticillium, Volvariella, orXylaria polypeptide.

In another aspect, the polypeptide is a Saccharomyces carlsbergensis,Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomycesdouglasii, Saccharomyces kluyveri, Saccharomyces norbensis, orSaccharomyces oviformis polypeptide.

In another aspect, the polypeptide is an Acremonium cellulolyticus,Aspergillus aculeatus, Aspergillus awamori, Aspergillus foetidus,Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans,Aspergillus niger, Aspergillus oryzae, Chrysosporium inops,Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporiummerdarium, Chrysosporium pannicola, Chrysosporium queenslandicum,Chrysosporium tropicum, Chrysosporium zonatum, Fusarium bactridioides,Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusariumgraminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi,Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusariumsambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusariumsulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusariumvenenatum, Humicola grisea, Humicola insolens, Humicola lanuginosa,Irpex lacteus, Mucor miehei, Myceliophthora thermophila, Neurosporacrassa, Penicillium funiculosum, Penicillium purpurogenum, Phanerochaetechrysosporium, Thielavia achromatica, Thielavia albomyces, Thielaviaalbopilosa, Thielavia australeinsis, Thielavia fimeti, Thielaviamicrospora, Thielavia ovispora, Thielavia peruviana, Thielavia setosa,Thielavia spededonium, Thielavia subthermophila, Thielavia terrestris,Trichoderma harzianum, Trichoderma koningii, Trichodermalongibrachiatum, Trichoderma reesei, or Trichoderma viride polypeptide.

In other aspects the polypeptide is derived from a marine organism suchas a polychaete e.g. a Terebella species such as Terebella lapidaria.

It will be understood that for the aforementioned species the inventionencompasses both the perfect and imperfect states, and other taxonomicequivalents, e.g., anamorphs, regardless of the species name by whichthey are known. Those skilled in the art will readily recognize theidentity of appropriate equivalents.

Strains of these species are readily accessible to the public in anumber of culture collections, such as the American Type CultureCollection (ATCC), Deutsche Sammlung von Mikroorganismen andZellkulturen GmbH (DSMZ), Centraalbureau Voor Schimmelcultures (CBS),and Agricultural Research Service Patent Culture Collection, NorthernRegional Research Center (NRRL). The polypeptide may be identified andobtained from other sources including microorganisms isolated fromnature (e.g., soil, composts, water, etc.) using the above-mentionedprobes. Techniques for isolating microorganisms from natural habitatsare well known in the art. A polynucleotide encoding the polypeptide maythen be obtained by similarly screening a genomic DNA or cDNA library ofanother microorganism or mixed DNA sample. Once a polynucleotideencoding a polypeptide has been detected with the probe(s), thepolynucleotide can be isolated or cloned by utilizing techniques thatare well known to those of ordinary skill in the art (see, e.g.,Sambrook et al., 1989, supra).

Polynucleotides

The present invention also relates to isolated polynucleotides encodinga polypeptide of the present invention, as described above.

The techniques used to isolate or clone a polynucleotide encoding apolypeptide are known in the art and include isolation from genomic DNAor cDNA, or a combination thereof. The cloning of the polynucleotidesfrom genomic DNA can be effected, e.g., by using the well knownpolymerase chain reaction (PCR) or antibody screening of expressionlibraries to detect cloned DNA fragments with shared structuralfeatures. See, e.g., Innis et al., 1990, PCR: A Guide to Methods andApplication, Academic Press, New York. Other nucleic acid amplificationprocedures such as ligase chain reaction (LCR), ligation activatedtranscription (LAT) and polynucleotide-based amplification (NASBA) maybe used. The polynucleotides may be cloned from a strain of Terebella ora related organism and thus, for example, may be an allelic or speciesvariant of the polypeptide encoding region of the polynucleotide.

Modification of a polynucleotide encoding a polypeptide of the presentinvention may be necessary for synthesizing polypeptides substantiallysimilar to the polypeptide. The term “substantially similar” to thepolypeptide refers to non-naturally occurring forms of the polypeptide.These polypeptides may differ in some engineered way from thepolypeptide isolated from its native source, e.g., variants that differin specific activity, thermostability, pH optimum, or the like. Thevariants may be constructed on the basis of the polynucleotide presentedas the mature polypeptide coding sequence of SEQ ID NO:1 or the genimocDNA sequence thereof, e.g., a subsequence thereof, and/or byintroduction of nucleotide substitutions that do not result in a changein the amino acid sequence of the polypeptide, but which correspond tothe codon usage of the host organism intended for production of theenzyme, or by introduction of nucleotide substitutions that may giverise to a different amino acid sequence. For a general description ofnucleotide substitution, see, e.g., Ford et al., 1991, ProteinExpression and Purification 2: 95-107.

Nucleic Acid Constructs

The present invention also relates to nucleic acid constructs comprisinga polynucleotide of the present invention operably linked to one or morecontrol sequences that direct the expression of the coding sequence in asuitable host cell under conditions compatible with the controlsequences.

A polynucleotide may be manipulated in a variety of ways to provide forexpression of the polypeptide. Manipulation of the polynucleotide priorto its insertion into a vector may be desirable or necessary dependingon the expression vector. The techniques for modifying polynucleotidesutilizing recombinant DNA methods are well known in the art.

The control sequence may be a promoter sequence, a polynucleotide thatis recognized by a host cell for expression of a polynucleotide encodinga polypeptide of the present invention. The promoter sequence containstranscriptional control sequences that mediate the expression of thepolypeptide. The promoter may be any polynucleotide that showstranscriptional activity in the host cell of choice including mutant,truncated, and hybrid promoters, and may be obtained from genes encodingextracellular or intracellular polypeptides either homologous orheterologous to the host cell.

Examples of suitable promoters for directing transcription of thenucleic acid constructs of the present invention in a bacterial hostcell are the promoters obtained from the Bacillus amyloliquefaciensalpha-amylase gene (amyQ), Bacillus licheniformis alpha-amylase gene(amyL), Bacillus licheniformis penicillinase gene (penP), Bacillusstearothermophilus maltogenic amylase gene (amyM), Bacillus subtilislevansucrase gene (sacB), Bacillus subtilis xylA and xylB genes, E. colilac operon, E. coli trc promoter (Egon et al., 1988, Gene 69: 301-315),Streptomyces coelicolor agarase gene (dagA), and prokaryoticbeta-lactamase gene (VIIIa-Kamaroff et al., 1978, Proc. Natl. Acad. Sci.USA 75: 3727-3731), as well as the tac promoter (DeBoer et al., 1983,Proc. Natl. Acad. Sci. USA 80: 21-25). Further promoters are describedin “Useful proteins from recombinant bacteria” in Gilbert et al., 1980,Scientific American, 242: 74-94; and in Sambrook et al., 1989, supra.

Examples of suitable promoters for directing transcription of thenucleic acid constructs of the present invention in a filamentous fungalhost cell are promoters obtained from the genes for Aspergillus nidulansacetamidase, Aspergillus niger neutral alpha-amylase, Aspergillus nigeracid stable alpha-amylase, Aspergillus niger or Aspergillus awamoriglucoamylase (glaA), Aspergillus oryzae TAKA amylase, Aspergillus oryzaealkaline protease, Aspergillus oryzae triose phosphate isomerase,Fusarium oxysporum trypsin-like protease (WO 96/00787), Fusariumvenenatum amyloglucosidase (WO 00/56900), Fusarium venenatum Daria (WO00/56900), Fusarium venenatum Quinn (WO 00/56900), Rhizomucor mieheilipase, Rhizomucor miehei aspartic proteinase, Trichoderma reeseibeta-glucosidase, Trichoderma reesei cellobiohydrolase I, Trichodermareesei cellobiohydrolase II, Trichoderma reesei endoglucanase I,Trichoderma reesei endoglucanase II, Trichoderma reesei endoglucanaseIII, Trichoderma reesei endoglucanase IV, Trichoderma reeseiendoglucanase V, Trichoderma reesei xylanase I, Trichoderma reeseixylanase II, Trichoderma reesei beta-xylosidase, as well as the NA2-tpipromoter (a modified promoter from an Aspergillus gene encoding aneutral alpha-amylase in which the untranslated leader has been replacedby an untranslated leader from an Aspergillus gene encoding a triosephosphate isomerase; non-limiting examples include modified promotersfrom an Aspergillus niger gene encoding neutral alpha-amylase in whichthe untranslated leader has been replaced by an untranslated leader froman Aspergillus nidulans or Aspergillus oryzae gene encoding a triosephosphate isomerase); and mutant, truncated, and hybrid promotersthereof.

In a yeast host, useful promoters are obtained from the genes forSaccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiaegalactokinase (GAL1), Saccharomyces cerevisiae alcoholdehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH1, ADH2/GAP),Saccharomyces cerevisiae triose phosphate isomerase (TPI), Saccharomycescerevisiae metallothionein (CUP1), and Saccharomyces cerevisiae3-phosphoglycerate kinase. Other useful promoters for yeast host cellsare described by Romanos et al., 1992, Yeast 8: 423-488.

The control sequence may also be a suitable transcription terminatorsequence, which is recognized by a host cell to terminate transcription.The terminator sequence is operably linked to the 3′-terminus of thepolynucleotide encoding the polypeptide. Any terminator that isfunctional in the host cell of choice may be used in the presentinvention.

Preferred terminators for filamentous fungal host cells are obtainedfrom the genes for Aspergillus nidulans anthranilate synthase,Aspergillus niger glucoamylase, Aspergillus niger alpha-glucosidase,Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-likeprotease.

Preferred terminators for yeast host cells are obtained from the genesfor Saccharomyces cerevisiae enolase, Saccharomyces cerevisiaecytochrome C (CYC1), and Saccharomyces cerevisiaeglyceraldehyde-3-phosphate dehydrogenase. Other useful terminators foryeast host cells are described by Romanos et al., 1992, supra.

The control sequence may also be a suitable leader sequence, whentranscribed is a nontranslated region of an mRNA that is important fortranslation by the host cell. The leader sequence is operably linked tothe 5′-terminus of the polynucleotide encoding the polypeptide. Anyleader sequence that is functional in the host cell of choice may beused.

Preferred leaders for filamentous fungal host cells are obtained fromthe genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulanstriose phosphate isomerase.

Suitable leaders for yeast host cells are obtained from the genes forSaccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, andSaccharomyces cerevisiae alcoholdehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).

The control sequence may also be a polyadenylation sequence, a sequenceoperably linked to the 3′-terminus of the polynucleotide and, whentranscribed, is recognized by the host cell as a signal to addpolyadenosine residues to transcribed mRNA. Any polyadenylation sequencethat is functional in the host cell of choice may be used.

Preferred polyadenylation sequences for filamentous fungal host cellsare obtained from the genes for Aspergillus oryzae TAKA amylase,Aspergillus niger glucoamylase, Aspergillus nidulans anthranilatesynthase, Fusarium oxysporum trypsin-like protease, and Aspergillusniger alpha-glucosidase.

Useful polyadenylation sequences for yeast host cells are described byGuo and Sherman, 1995, Mol. Cellular. Biol. 15: 5983-5990.

The control sequence may also be a signal peptide coding region thatencodes a signal peptide linked to the N-terminus of a polypeptide anddirects the polypeptide into the cell's secretory pathway. The 5′-end ofthe coding sequence of the polynucleotide may inherently contain asignal peptide coding sequence naturally linked in translation readingframe with the segment of the coding sequence that encodes thepolypeptide. Alternatively, the 5′-end of the coding sequence maycontain a signal peptide coding sequence that is foreign to the codingsequence. A foreign signal peptide coding sequence may be required wherethe coding sequence does not naturally contain a signal peptide codingsequence. Alternatively, a foreign signal peptide coding sequence maysimply replace the natural signal peptide coding sequence in order toenhance secretion of the polypeptide. However, any signal peptide codingsequence that directs the expressed polypeptide into the secretorypathway of a host cell of choice may be used.

Effective signal peptide coding sequences for bacterial host cells arethe signal peptide coding sequences obtained from the genes for BacillusNCIB 11837 maltogenic amylase, Bacillus licheniformis subtilisin,Bacillus licheniformis beta-lactamase, Bacillus stearothermophilusalpha-amylase, Bacillus stearothermophilus neutral proteases (nprT,nprS, nprM), and Bacillus subtilis prsA. Further signal peptides aredescribed by Simonen and Palva, 1993, Microbiological Reviews 57:109-137.

Effective signal peptide coding sequences for filamentous fungal hostcells are the signal peptide coding sequences obtained from the genesfor Aspergillus niger neutral amylase, Aspergillus niger glucoamylase,Aspergillus oryzae TAKA amylase, Humicola insolens cellulase, Humicolainsolens endoglucanase V, Humicola lanuginosa lipase, and Rhizomucormiehei aspartic proteinase.

Useful signal peptides for yeast host cells are obtained from the genesfor Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiaeinvertase. Other useful signal peptide coding sequences are described byRomanos et al., 1992, supra.

The control sequence may also be a propeptide coding sequence thatencodes a propeptide positioned at the N-terminus of a polypeptide. Theresultant polypeptide is known as a proenzyme or propolypeptide (or azymogen in some cases). A propolypeptide is generally inactive and canbe converted to an active polypeptide by catalytic or autocatalyticcleavage of the propeptide from the propolypeptide. The propeptidecoding sequence may be obtained from the genes for Bacillus subtilisalkaline protease (aprE), Bacillus subtilis neutral protease (nprT),Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor mieheiaspartic proteinase, and Saccharomyces cerevisiae alpha-factor.

Where both signal peptide and propeptide sequences are present at theN-terminus of a polypeptide, the propeptide sequence is positioned nextto the N-terminus of a polypeptide and the signal peptide sequence ispositioned next to the N-terminus of the propeptide sequence.

It may also be desirable to add regulatory sequences that regulateexpression of the polypeptide relative to the growth of the host cell.Examples of regulatory systems are those that cause expression of thegene to be turned on or off in response to a chemical or physicalstimulus, including the presence of a regulatory compound. Regulatorysystems in prokaryotic systems include the lac, tac, and trp operatorsystems. In yeast, the ADH2 system or GAL1 system may be used. Infilamentous fungi, the Aspergillus niger glucoamylase promoter,Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzaeglucoamylase promoter may be used. Other examples of regulatorysequences are those that allow for gene amplification. In eukaryoticsystems, these regulatory sequences include the dihydrofolate reductasegene that is amplified in the presence of methotrexate, and themetallothionein genes that are amplified with heavy metals. In thesecases, the polynucleotide encoding the polypeptide would be operablylinked with the regulatory sequence.

Expression Vectors

The present invention also relates to recombinant expression vectorscomprising a polynucleotide of the present invention, a promoter, andtranscriptional and translational stop signals. The various nucleotideand control sequences may be joined together to produce a recombinantexpression vector that may include one or more convenient restrictionsites to allow for insertion or substitution of the polynucleotideencoding the polypeptide at such sites. Alternatively, thepolynucleotide may be expressed by inserting the polynucleotide or anucleic acid construct comprising the sequence into an appropriatevector for expression. In creating the expression vector, the codingsequence is located in the vector so that the coding sequence isoperably linked with the appropriate control sequences for expression.

The recombinant expression vector may be any vector (e.g., a plasmid orvirus) that can be conveniently subjected to recombinant DNA proceduresand can bring about expression of the polynucleotide. The choice of thevector will typically depend on the compatibility of the vector with thehost cell into which the vector is to be introduced. The vector may be alinear or closed circular plasmid.

The vector may be an autonomously replicating vector, i.e., a vectorthat exists as an extrachromosomal entity, the replication of which isindependent of chromosomal replication, e.g., a plasmid, anextrachromosomal element, a minichromosome, or an artificial chromosome.The vector may contain any means for assuring self-replication.Alternatively, the vector may be one that, when introduced into the hostcell, is integrated into the genome and replicated together with thechromosome(s) into which it has been integrated. Furthermore, a singlevector or plasmid or two or more vectors or plasmids that togethercontain the total DNA to be introduced into the genome of the host cell,or a transposon, may be used.

The vector preferably contains one or more selectable markers thatpermit easy selection of transformed, transfected, transduced, or thelike cells. A selectable marker is a gene the product of which providesfor biocide or viral resistance, resistance to heavy metals, prototrophyto auxotrophs, and the like.

Examples of bacterial selectable markers are the dal genes from Bacillussubtilis or Bacillus licheniformis, or markers that confer antibioticresistance such as ampicillin, chloramphenicol, kanamycin, ortetracycline resistance. Suitable markers for yeast host cells are ADE2,HIS3, LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in afilamentous fungal host cell include, but are not limited to, amdS(acetamidase), argB (ornithine carbamoyltransferase), bar(phosphinothricin acetyltransferase), hph (hygromycinphosphotransferase), niaD (nitrate reductase), pyrG(orotidine-5′-phosphate decarboxylase), sC (sulfate adenyltransferase),and trpC (anthranilate synthase), as well as equivalents thereof.Preferred for use in an Aspergillus cell are Aspergillus nidulans orAspergillus oryzae amdS and pyrG genes and a Streptomyces hygroscopicusbargene.

The vector preferably contains an element(s) that permits integration ofthe vector into the host cell's genome or autonomous replication of thevector in the cell independent of the genome.

For integration into the host cell genome, the vector may rely on thepolynucleotide's sequence encoding the polypeptide or any other elementof the vector for integration into the genome by homologous ornon-homologous recombination. Alternatively, the vector may containadditional polynucleotides for directing integration by homologousrecombination into the genome of the host cell at a precise location(s)in the chromosome(s). To increase the likelihood of integration at aprecise location, the integrational elements should contain a sufficientnumber of nucleic acids, such as 100 to 10,000 base pairs, 400 to 10,000base pairs, and 800 to 10,000 base pairs, which have a high degree ofsequence identity to the corresponding target sequence to enhance theprobability of homologous recombination. The integrational elements maybe any sequence that is homologous with the target sequence in thegenome of the host cell. Furthermore, the integrational elements may benon-encoding or encoding polynucleotides. On the other hand, the vectormay be integrated into the genome of the host cell by non-homologousrecombination.

For autonomous replication, the vector may further comprise an origin ofreplication enabling the vector to replicate autonomously in the hostcell in question. The origin of replication may be any plasmidreplicator mediating autonomous replication that functions in a cell.The term “origin of replication” or “plasmid replicator” means apolynucleotide that enables a plasmid or vector to replicate in vivo.

Examples of bacterial origins of replication are the origins ofreplication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permittingreplication in E. coli, and pUB110, pE194, pTA1060, and pAMβ1 permittingreplication in Bacillus.

Examples of origins of replication for use in a yeast host cell are the2 micron origin of replication, ARS1, ARS4, the combination of ARS1 andCEN3, and the combination of ARS4 and CEN6.

Examples of origins of replication useful in a filamentous fungal cellare AMA1 and ANS1 (Gems et al., 1991, Gene 98: 61-67; Cullen et al.,1987, Nucleic Acids Res. 15: 9163-9175; WO 00/24883). Isolation of theAMA1 gene and construction of plasmids or vectors comprising the genecan be accomplished according to the methods disclosed in WO 00/24883.

More than one copy of a polynucleotide of the present invention may beinserted into a host cell to increase production of a polypeptide. Anincrease in the copy number of the polynucleotide can be obtained byintegrating at least one additional copy of the sequence into the hostcell genome or by including an amplifiable selectable marker gene withthe polynucleotide where cells containing amplified copies of theselectable marker gene, and thereby additional copies of thepolynucleotide, can be selected for by cultivating the cells in thepresence of the appropriate selectable agent.

The procedures used to ligate the elements described above to constructthe recombinant expression vectors of the present invention are wellknown to one skilled in the art (see, e.g., Sambrook et al., 1989,supra).

Host Cells

The present invention also relates to recombinant host cells, comprisinga polynucleotide of the present invention operably linked to one or morecontrol sequences that direct the production of a polypeptide of thepresent invention. A construct or vector comprising a polynucleotide isintroduced into a host cell so that the construct or vector ismaintained as a chromosomal integrant or as a self-replicatingextra-chromosomal vector as described earlier. The term “host cell”encompasses any progeny of a parent cell that is not identical to theparent cell due to mutations that occur during replication. The choiceof a host cell will to a large extent depend upon the gene encoding thepolypeptide and its source.

The host cell may be any cell useful in the recombinant production of apolypeptide of the present invention, e.g., a prokaryote or a eukaryote.

The prokaryotic host cell may be any Gram-positive or Gram-negativebacterium. Gram-positive bacteria include, but not limited to, Bacillus,Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus,Oceanobacillus, Staphylococcus, Streptococcus, and Streptomyces.Gram-negative bacteria include, but not limited to, Campylobacter, E.coli, Flavobacterium, Fusobacterium, Helicobacter, Ilyobacter,Neisseria, Pseudomonas, Salmonella, and Ureaplasma.

The bacterial host cell may be any Bacillus cell including, but notlimited to, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillusbrevis, Bacillus circulans, Bacillus clausll, Bacillus coagulans,Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacilluslicheniformis, Bacillus megaterium, Bacillus pumilus, Bacillusstearothermophilus, Bacillus subtilis, and Bacillus thuringiensis cells.

The bacterial host cell may also be any Streptococcus cell including,but not limited to, Streptococcus equisimilis, Streptococcus pyogenes,Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus cells.

The bacterial host cell may also be any Streptomyces cell including, butnot limited to, Streptomyces achromogenes, Streptomyces avermitilis,Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividanscells.

The introduction of DNA into a Bacillus cell may be effected byprotoplast transformation (see, e.g., Chang and Cohen, 1979, Mol. Gen.Genet. 168: 111-115), using competent cells (see, e.g., Young andSpizizen, 1961, J. Bacteriol. 81: 823-829, or Dubnau andDavidoff-Abelson, 1971, J. Mol. Biol. 56: 209-221), electroporation(see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), orconjugation (see, e.g., Koehler and Thorne, 1987, J. Bacteriol. 169:5271-5278). The introduction of DNA into an E. coli cell may be effectedby protoplast transformation (see, e.g., Hanahan, 1983, J. Mol. Biol.166: 557-580) or electroporation (see, e.g., Dower et al., 1988, NucleicAcids Res. 16: 6127-6145). The introduction of DNA into a Streptomycescell may be effected by protoplast transformation and electroporation(see, e.g., Gong et al., 2004, Folia Microbiol. (Praha) 49: 399-405),conjugation (see, e.g., Mazodier et al., 1989, J. Bacteriol. 171:3583-3585), or transduction (see, e.g., Burke et al., 2001, Proc. Natl.Acad. Sci. USA 98: 6289-6294). The introduction of DNA into aPseudomonas cell may be effected by electroporation (see, e.g., Choi etal., 2006, J. Microbiol. Methods 64: 391-397) or conjugation (see, e.g.,Pinedo and Smets, 2005, Appl. Environ. Microbiol. 71: 51-57). Theintroduction of DNA into a Streptococcus cell may be effected by naturalcompetence (see, e.g., Perry and Kuramitsu, 1981, Infect. Immun. 32:1295-1297), protoplast transformation (see, e.g., Catt and Jollick,1991, Microbios 68: 189-207), electroporation (see, e.g., Buckley etal., 1999, Appl. Environ. Microbiol. 65: 3800-3804) or conjugation (see,e.g., Clewell, 1981, Microbiol. Rev. 45: 409-436). However, any methodknown in the art for introducing DNA into a host cell can be used.

The host cell may also be a eukaryote, such as a mammalian, insect,plant, or fungal cell.

The host cell may be a fungal cell. “Fungi” as used herein includes thephyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (asdefined by Hawksworth et al., In, Ainsworth and Bisby's Dictionary ofThe Fungi, 8th edition, 1995, CAB International, University Press,Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al.,1995, supra, page 171) and all mitosporic fungi (Hawksworth et al.,1995, supra).

The fungal host cell may be a yeast cell. “Yeast” as used hereinincludes ascosporogenous yeast (Endomycetales), basidiosporogenousyeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes).Since the classification of yeast may change in the future, for thepurposes of this invention, yeast shall be defined as described inBiology and Activities of Yeast (Skinner, F. A., Passmore, S. M., andDavenport, R. R., eds, Soc. App. Bacteriol. Symposium Series No. 9,1980).

The yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia,Saccharomyces, Schizosaccharomyces, or Yarrowia cell such as aKluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomycescerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii,Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomycesoviformis, or Yarrowia lipolytica cell.

The fungal host cell may be a filamentous fungal cell. “Filamentousfungi” include all filamentous forms of the subdivision Eumycota andOomycota (as defined by Hawksworth et al., 1995, supra). The filamentousfungi are generally characterized by a mycelial wall composed of chitin,cellulose, glucan, chitosan, mannan, and other complex polysaccharides.Vegetative growth is by hyphal elongation and carbon catabolism isobligately aerobic. In contrast, vegetative growth by yeasts such asSaccharomyces cerevisiae is by budding of a unicellular thallus andcarbon catabolism may be fermentative.

The filamentous fungal host cell may be an Acremonium, Aspergillus,Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus,Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe,Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces,Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus,Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium,Trametes, or Trichoderma cell.

For example, the filamentous fungal host cell may be an Aspergillusawamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillusjaponicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae,Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea,Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsisrivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora,Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporiumlucknowense, Chrysosporium merdarium, Chrysosporium pannicola,Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporiumzonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides,Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusariumgraminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi,Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusariumsambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusariumsulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusariumvenenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei,Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum,Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii,Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichodermaharzianum, Trichoderma koningii, Trichoderma longibrachiatum,Trichoderma reesei, or Trichoderma viride cell.

Fungal cells may be transformed by a process involving protoplastformation, transformation of the protoplasts, and regeneration of thecell wall in a manner known per se. Suitable procedures fortransformation of Aspergillus and Trichoderma host cells are describedin EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81:1470-1474, and Christensen et al., 1988, Bio/Technology 6: 1419-1422.Suitable methods for transforming Fusarium species are described byMalardier et al., 1989, Gene 78: 147-156, and WO 96/00787. Yeast may betransformed using the procedures described by Becker and Guarente, InAbelson, J. N. and Simon, M. I., editors, Guide to Yeast Genetics andMolecular Biology, Methods in Enzymology, Volume 194, pp 182-187,Academic Press, Inc., New York; Ito et al., 1983, J. Bacteriol. 153:163; and Hinnen et al., 1978, Proc. Natl. Acad. Sci. USA 75: 1920.

Methods of Production

Based on the nucleotide sequence identified as SEQ ID NO: 1, a syntheticgene may be obtained from a number of vendors such as Gene Art (GENEARTAG BioPark, Josef-Engert-Str. 11, 93053, Regensburg, Germany) or DNA 2.0(DNA2.0, 1430 O'Brien Drive, Suite E, Menlo Park, Calif. 94025, USA. Thesynthetic gene may be designed to incorporate additional DNA sequencessuch as restriction sites or homologous recombination regions tofacilitate direct cloning into an expression vector.

Alternatively, using synthetic oligonucleotide primers such as (SEQ IDNO: 3) and (SEQ ID NO: 4), a simple PCR reaction can be used to amplifythe full-length open reading frame from the gene of SEQ ID NO: 1. Thegene can then be cloned into an expression vector for example asdescribed above and expressed in a host cell, for example in Aspergillusoryzae.

The present invention also relates to methods of producing a polypeptideof the present invention, comprising: (a) cultivating a cell, which inits wild-type form produces the polypeptide, under conditions conducivefor production of the polypeptide; and (b) recovering the polypeptide.In a preferred aspect, the cell is a Terebella cell. In a more preferredaspect, the cell is a Terebella lapidaria cell.

The present invention also relates to methods of producing a polypeptideof the present invention, comprising: (a) cultivating a recombinant hostcell of the present invention under conditions conducive for productionof the polypeptide; and (b) recovering the polypeptide.

The host cells are cultivated in a nutrient medium suitable forproduction of the polypeptide using methods well known in the art. Forexample, the cell may be cultivated by shake flask cultivation, andsmall-scale or large-scale fermentation (including continuous, batch,fed-batch, or solid state fermentations) in laboratory or industrialfermentors performed in a suitable medium and under conditions allowingthe polypeptide to be expressed and/or isolated. The cultivation takesplace in a suitable nutrient medium comprising carbon and nitrogensources and inorganic salts, using procedures known in the art. Suitablemedia are available from commercial suppliers or may be preparedaccording to published compositions (e.g., in catalogues of the AmericanType Culture Collection). If the polypeptide is secreted into thenutrient medium, the polypeptide can be recovered directly from themedium. If the polypeptide is not secreted, it can be recovered fromcell lysates.

The polypeptide may be detected using methods known in the art that arespecific for the polypeptides. These detection methods may include useof specific antibodies, formation of an enzyme product, or disappearanceof an enzyme substrate. For example, an enzyme assay may be used todetermine the activity of the polypeptide.

The polypeptide may be recovered using methods known in the art. Forexample, the polypeptide may be recovered from the nutrient medium byconventional procedures including, but not limited to, centrifugation,filtration, extraction, spray-drying, evaporation, or precipitation.

The polypeptide may be purified by a variety of procedures known in theart including, but not limited to, chromatography (e.g., ion exchange,affinity, hydrophobic, chromatofocusing, and size exclusion),electrophoretic procedures (e.g., preparative isoelectric focusing),differential solubility (e.g., ammonium sulfate precipitation),SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Jansonand Lars Ryden, editors, VCH Publishers, New York, 1989) to obtainsubstantially pure polypeptides.

In an alternative aspect, the polypeptide is not recovered, but rather ahost cell of the present invention expressing the polypeptide is used asa source of the polypeptide.

Plants

The present invention also relates to isolated plants, e.g., atransgenic plant, plant part, or plant cell, comprising a polynucleotideof the present invention so as to express and produce a polypeptide ordomain in recoverable quantities. The polypeptide or domain may berecovered from the plant or plant part. Alternatively, the plant orplant part containing the polypeptide or domain may be used as such forimproving the quality of a food or feed, e.g., improving nutritionalvalue, palatability, and rheological properties, or to destroy anantinutritive factor.

The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous(a monocot). Examples of monocot plants are grasses, such as meadowgrass (blue grass, Poa), forage grass such as Festuca, Lolium, temperategrass, such as Agrostis, and cereals, e.g., wheat, oats, rye, barley,rice, sorghum, and maize (corn).

Examples of dicot plants are tobacco, legumes, such as lupins, potato,sugar beet, pea, bean and soybean, and cruciferous plants (familyBrassicaceae), such as cauliflower, rape seed, and the closely relatedmodel organism Arabidopsis thaliana.

Examples of plant parts are stem, callus, leaves, root, fruits, seeds,and tubers as well as the individual tissues comprising these parts,e.g., epidermis, mesophyll, parenchyme, vascular tissues, meristems.Specific plant cell compartments, such as chloroplasts, apoplasts,mitochondria, vacuoles, peroxisomes and cytoplasm are also considered tobe a plant part. Furthermore, any plant cell, whatever the tissueorigin, is considered to be a plant part. Likewise, plant parts such asspecific tissues and cells isolated to facilitate the utilization of theinvention are also considered plant parts, e.g., embryos, endosperms,aleurone and seed coats.

Also included within the scope of the present invention are the progenyof such plants, plant parts, and plant cells.

The transgenic plant or plant cell expressing the polypeptide or domainmay be constructed in accordance with methods known in the art. Inshort, the plant or plant cell is constructed by incorporating one ormore expression constructs encoding the polypeptide or domain into theplant host genome or chloroplast genome and propagating the resultingmodified plant or plant cell into a transgenic plant or plant cell.

The expression construct is conveniently a nucleic acid construct thatcomprises a polynucleotide encoding a polypeptide or domain operablylinked with appropriate regulatory sequences required for expression ofthe polynucleotide in the plant or plant part of choice. Furthermore,the expression construct may comprise a selectable marker useful foridentifying host cells into which the expression construct has beenintegrated and DNA sequences necessary for introduction of the constructinto the plant in question (the latter depends on the DNA introductionmethod to be used).

The choice of regulatory sequences, such as promoter and terminatorsequences and optionally signal or transit sequences, is determined, forexample, on the basis of when, where, and how the polypeptide or domainis desired to be expressed. For instance, the expression of the geneencoding a polypeptide or domain may be constitutive or inducible, ormay be developmental, stage or tissue specific, and the gene product maybe targeted to a specific tissue or plant part such as seeds or leaves.Regulatory sequences are, for example, described by Tague et al., 1988,Plant Physiology 86: 506.

For constitutive expression, the ³⁵S-CaMV, the maize ubiquitin 1, or therice actin 1 promoter may be used (Franck et al., 1980, Cell 21:285-294; Christensen et al., 1992, Plant Mol. Biol. 18: 675-689; Zhanget al., 1991, Plant Cell 3: 1155-1165). Organ-specific promoters may be,for example, a promoter from storage sink tissues such as seeds, potatotubers, and fruits (Edwards and Coruzzi, 1990, Ann. Rev. Genet. 24:275-303), or from metabolic sink tissues such as meristems (Ito et al.,1994, Plant Mol. Biol. 24: 863-878), a seed specific promoter such asthe glutelin, prolamin, globulin, or albumin promoter from rice (Wu etal., 1998, Plant Cell Physiol. 39: 885-889), a Vicia faba promoter fromthe legumin B4 and the unknown seed protein gene from Vicia faba (Conradet al., 1998, J. Plant Physiol. 152: 708-711), a promoter from a seedoil body protein (Chen et al., 1998, Plant Cell Physiol. 39: 935-941),the storage protein napA promoter from Brassica napus, or any other seedspecific promoter known in the art, e.g., as described in WO 91/14772.Furthermore, the promoter may be a leaf specific promoter such as therbcs promoter from rice or tomato (Kyozuka et al., 1993, Plant Physiol.102: 991-1000), the chlorella virus adenine methyltransferase genepromoter (Mitra and Higgins, 1994, Plant Mol. Biol. 26: 85-93), the aldPgene promoter from rice (Kagaya et al., 1995, Mol. Gen. Genet. 248:668-674), or a wound inducible promoter such as the potato pin2 promoter(Xu et al., 1993, Plant Mol. Biol. 22: 573-588). Likewise, the promotermay be induced by abiotic treatments such as temperature, drought, oralterations in salinity or induced by exogenously applied substancesthat activate the promoter, e.g., ethanol, oestrogens, plant hormonessuch as ethylene, abscisic acid, and gibberellic acid, and heavy metals.

A promoter enhancer element may also be used to achieve higherexpression of a polypeptide or domain in the plant. For instance, thepromoter enhancer element may be an intron that is placed between thepromoter and the polynucleotide encoding a polypeptide or domain. Forinstance, Xu et al., 1993, supra, disclose the use of the first intronof the rice actin 1 gene to enhance expression.

The selectable marker gene and any other parts of the expressionconstruct may be chosen from those available in the art.

The nucleic acid construct is incorporated into the plant genomeaccording to conventional techniques known in the art, includingAgrobacterium-mediated transformation, virus-mediated transformation,microinjection, particle bombardment, biolistic transformation, andelectroporation (Gasser et al., 1990, Science 244: 1293; Potrykus, 1990,Bio/Technology 8: 535; Shimamoto et al., 1989, Nature 338: 274).

Presently, Agrobacterium tumefaciens-mediated gene transfer is themethod of choice for generating transgenic dicots (for a review, seeHooykas and Schilperoort, 1992, Plant Mol. Biol. 19: 15-38) and can alsobe used for transforming monocots, although other transformation methodsare often used for these plants. Presently, the method of choice forgenerating transgenic monocots is particle bombardment (microscopic goldor tungsten particles coated with the transforming DNA) of embryoniccalli or developing embryos (Christou, 1992, Plant J. 2: 275-281;Shimamoto, 1994, Curr. Opin. Biotechnol. 5: 158-162; Vasil et al., 1992,Bio/Technology 10: 667-674). An alternative method for transformation ofmonocots is based on protoplast transformation as described by Omirullehet al., 1993, Plant Mol. Biol. 21: 415-428. Additional transformationmethods for use in accordance with the present disclosure include thosedescribed in U.S. Pat. Nos. 6,395,966 and 7,151,204 (both of which areherein incorporated by reference in their entirety).

Following transformation, the transformants having incorporated theexpression construct are selected and regenerated into whole plantsaccording to methods well known in the art. Often the transformationprocedure is designed for the selective elimination of selection geneseither during regeneration or in the following generations by using, forexample, co-transformation with two separate T-DNA constructs or sitespecific excision of the selection gene by a specific recombinase.

In addition to direct transformation of a particular plant genotype witha construct of the present invention, transgenic plants may be made bycrossing a plant having the construct to a second plant lacking theconstruct. For example, a construct encoding a polypeptide or domain canbe introduced into a particular plant variety by crossing, without theneed for ever directly transforming a plant of that given variety.Therefore, the present invention encompasses not only a plant directlyregenerated from cells which have been transformed in accordance withthe present invention, but also the progeny of such plants. As usedherein, progeny may refer to the offspring of any generation of a parentplant prepared in accordance with the present invention. Such progenymay include a DNA construct prepared in accordance with the presentinvention. Crossing results in the introduction of a transgene into aplant line by cross pollinating a starting line with a donor plant line.Non-limiting examples of such steps are described in U.S. Pat. No.7,151,204.

Plants may be generated through a process of backcross conversion. Forexample, plants include plants referred to as a backcross convertedgenotype, line, inbred, or hybrid.

Genetic markers may be used to assist in the introgression of one ormore transgenes of the invention from one genetic background intoanother. Marker assisted selection offers advantages relative toconventional breeding in that it can be used to avoid errors caused byphenotypic variations. Further, genetic markers may provide dataregarding the relative degree of elite germplasm in the individualprogeny of a particular cross. For example, when a plant with a desiredtrait which otherwise has a non-agronomically desirable geneticbackground is crossed to an elite parent, genetic markers may be used toselect progeny which not only possess the trait of interest, but alsohave a relatively large proportion of the desired germplasm. In thisway, the number of generations required to introgress one or more traitsinto a particular genetic background is minimized.

The present invention also relates to methods of producing a polypeptideor domain of the present invention comprising: (a) cultivating atransgenic plant or a plant cell comprising a polynucleotide encodingthe polypeptide or domain under conditions conducive for production ofthe polypeptide or domain; and (b) recovering the polypeptide or domain.

Detergent Compositions

In one embodiment, the invention is directed to detergent compositionscomprising an enzyme of the present invention in combination with one ormore additional cleaning composition components. The choice ofadditional components is within the skill of the artisan and includesconventional ingredients, including the exemplary non-limitingcomponents set forth below.

The choice of components may include, for textile care, theconsideration of the type of textile to be cleaned, the type and/ordegree of soiling, the temperature at which cleaning is to take place,and the formulation of the detergent product. Although componentsmentioned below are categorized by general header according to aparticular functionality, this is not to be construed as a limitation,as a component may comprise additional functionalities as will beappreciated by the skilled artisan.

Enzyme of the Present Invention

In one embodiment of the present invention, the a polypeptide of thepresent invention may be added to a detergent composition in an amountcorresponding to 0.001-100 mg of protein, such as 0.01-100 mg ofprotein, preferably 0.005-50 mg of protein, more preferably 0.01-25 mgof protein, even more preferably 0.05-10 mg of protein, most preferably0.05-5 mg of protein, and even most preferably 0.01-1 mg of protein perliter of wash liquor.

The enzyme(s) of the detergent composition of the invention may bestabilized using conventional stabilizing agents, e.g., a polyol such aspropylene glycol or glycerol, a sugar or sugar alcohol, lactic acid,boric acid, or a boric acid derivative, e.g., an aromatic borate ester,or a phenyl boronic acid derivative such as 4-formylphenyl boronic acid,and the composition may be formulated as described in, for example,WO92/19709 and WO92/19708.

A polypeptide of the present invention may also be incorporated in thedetergent formulations disclosed in WO97/07202, which is herebyincorporated by reference.

Surfactants

The detergent composition may comprise one or more surfactants, whichmay be anionic and/or cationic and/or non-ionic and/or semi-polar and/orzwitterionic, or a mixture thereof. In a particular embodiment, thedetergent composition includes a mixture of one or more nonionicsurfactants and one or more anionic surfactants. The surfactant(s) istypically present at a level of from about 0.1% to 60% by weight, suchas about 1% to about 40%, or about 3% to about 20%, or about 3% to about10%. The surfactant(s) is chosen based on the desired cleaningapplication, and includes any conventional surfactant(s) known in theart. Any surfactant known in the art for use in detergents may beutilized.

When included therein the detergent will usually contain from about 1%to about 40% by weight, such as from about 5% to about 30%, includingfrom about 5% to about 15%, or from about 20% to about 25% of an anionicsurfactant. Non-limiting examples of anionic surfactants includesulfates and sulfonates, in particular, linear alkylbenzenesulfonates(LAS), isomers of LAS, branched alkylbenzenesulfonates (BABS),phenylalkanesulfonates, alpha-olefinsulfonates (AOS), olefin sulfonates,alkene sulfonates, alkane-2,3-diylbis(sulfates), hydroxyalkanesulfonatesand disulfonates, alkyl sulfates (AS) such as sodium dodecyl sulfate(SDS), fatty alcohol sulfates (FAS), primary alcohol sulfates (PAS),alcohol ethersulfates (AES or AEOS or FES, also known as alcoholethoxysulfates or fatty alcohol ether sulfates), secondaryalkanesulfonates (SAS), paraffin sulfonates (PS), ester sulfonates,sulfonated fatty acid glycerol esters, alpha-sulfo fatty acid methylesters (alpha-SFMe or SES) including methyl ester sulfonate (MES),alkyl- or alkenylsuccinic acid, dodecenyl/tetradecenyl succinic acid(DTSA), fatty acid derivatives of amino acids, diesters and monoestersof sulfo-succinic acid or soap, and combinations thereof.

When included therein the detergent will usually contain from about 1%to about 40% by weight of a cationic surfactant. Non-limiting examplesof cationic surfactants include alkyldimethylehanolamine quat (ADMEAQ),cetyltrimethylammonium bromide (CTAB), dimethyldistearylammoniumchloride (DSDMAC), and alkylbenzyldimethylammonium, and combinationsthereof, Alkyl quaternary ammonium compounds, Alkoxylated quaternaryammonium (AQA),

When included therein the detergent will usually contain from about 0.2%to about 40% by weight of a non-ionic surfactant, for example from about0.5% to about 30%, in particular from about 1% to about 20%, from about3% to about 10%, such as from about 3% to about 5%, or from about 8% toabout 12%. Non-limiting examples of non-ionic surfactants includealcohol ethoxylates (AE or AEO), alcohol propoxylates, propoxylatedfatty alcohols (PFA), alkoxylated fatty acid alkyl esters, such asethoxylated and/or propoxylated fatty acid alkyl esters, alkylphenolethoxylates (APE), nonylphenol ethoxylates (NPE), alkylpolyglycosides(APG), alkoxylated amines, fatty acid monoethanolamides (FAM), fattyacid diethanolamides (FADA), ethoxylated fatty acid monoethanolamides(EFAM), propoxylated fatty acid monoethanolamide (PFAM), polyhydroxyalkyl fatty acid amides, or N-acyl N-alkyl derivatives of glucosamine(glucamides, GA, or fatty acid glucamide, FAGA), as well as productsavailable under the trade names SPAN and TWEEN, and combinationsthereof.

When included therein the detergent will usually contain from about 1%to about 40% by weight of a semipolar surfactant. Non-limiting examplesof semipolar surfactants include amine oxides (AO) such asalkyldimethylamineoxide, N-(coco alkyl)-N,N-dimethylamine oxide andN-(tallow-alkyl)-N,N-bis(2-hydroxyethyl)amine oxide, fatty acidalkanolamides and ethoxylated fatty acid alkanolamides, and combinationsthereof.

When included therein the detergent will usually contain from about 1%to about 40% by weight of a zwitterionic surfactant. Non-limitingexamples of zwitterionic surfactants include betaine,alkyldimethylbetaine, and sulfobetaine, and combinations thereof.

Hydrotropes

A hydrotrope is a compound that solubilises hydrophobic compounds inaqueous solutions (or oppositely, polar substances in a non-polarenvironment). Typically, hydrotropes have both hydrophilic and ahydrophobic character (so-called amphiphilic properties as known fromsurfactants); however the molecular structure of hydrotropes generallydo not favor spontaneous self-aggregation, see e.g. review by Hodgdonand Kaler (2007), Current Opinion in Colloid & Interface Science 12:121-128. Hydrotropes do not display a critical concentration above whichself-aggregation occurs as found for surfactants and lipids formingmiceller, lamellar or other well defined meso-phases. Instead, manyhydrotropes show a continuous-type aggregation process where the sizesof aggregates grow as concentration increases. However, many hydrotropesalter the phase behavior, stability, and colloidal properties of systemscontaining substances of polar and non-polar character, includingmixtures of water, oil, surfactants, and polymers. Hydrotropes areclassically used across industries from pharma, personal care, food, totechnical applications. Use of hydrotropes in detergent compositionsallow for example more concentrated formulations of surfactants (as inthe process of compacting liquid detergents by removing water) withoutinducing undesired phenomena such as phase separation or high viscosity.

The detergent may contain 0-5% by weight, such as about 0.5 to about 5%,or about 3% to about 5%, of a hydrotrope. Any hydrotrope known in theart for use in detergents may be utilized. Non-limiting examples ofhydrotropes include sodium benzene sulfonate, sodium p-toluenesulfonates (STS), sodium xylene sulfonates (SXS), sodium cumenesulfonates (SCS), sodium cymene sulfonate, amine oxides, alcohols andpolyglycolethers, sodium hydroxynaphthoate, sodium hydroxynaphthalenesulfonate, sodium ethylhexyl sulfate, and combinations thereof.

Builders and Co-Builders

The detergent composition may contain about 0-65% by weight, such asabout 5% to about 50% of a detergent builder or co-builder, or a mixturethereof. In a dish wash detergent, the level of builder is typically40-65%, particularly 50-65%. The builder and/or co-builder mayparticularly be a chelating agent that forms water-soluble complexeswith Ca and Mg. Any builder and/or co-builder known in the art for usein laundry detergents may be utilized. Non-limiting examples of buildersinclude zeolites, diphosphates (pyrophosphates), triphosphates such assodium triphosphate (STP or STPP), carbonates such as sodium carbonate,soluble silicates such as sodium metasilicate, layered silicates (e.g.,SKS-6 from Hoechst), ethanolamines such as 2-aminoethan-1-ol (MEA),iminodiethanol (DEA) and 2,2′,2″-nitrilotriethanol (TEA), andcarboxymethylinulin (CMI), and combinations thereof.

The detergent composition may also contain 0-50% by weight, such asabout 5% to about 40%, of a detergent co-builder, or a mixture thereof.The detergent composition may include a co-builder alone, or incombination with a builder, for example a zeolite builder. Non-limitingexamples of co-builders include homopolymers of polyacrylates orcopolymers thereof, such as poly(acrylic acid) (PAA) or copoly(acrylicacid/maleic acid) (PAA/PMA). Further non-limiting examples includecitrate, chelators such as aminocarboxylates, aminopolycarboxylates andphosphonates, and alkyl- or alkenylsuccinic acid. Additional specificexamples include 2,2′,2″-nitrilotriacetic acid (NTA),etheylenediaminetetraacetic acid (EDTA), diethylenetriaminepentaaceticacid (DTPA), iminodisuccinic acid (IDS), ethylenediamine-N,N′-disuccinicacid (EDDS), methylglycinediacetic acid (MGDA), glutamicacid-N,N-diacetic acid (GLDA), 1-hydroxyethane-1,1-diylbis(phosphonicacid) (HEDP), ethylenediaminetetrakis(methylene)tetrakis(phosphonicacid) (EDTMPA), diethylenetriaminepentakis(methylene)pentakis(phosphonicacid) (DTPMPA), N-(2-hydroxyethyl)iminodiacetic acid (EDG), asparticacid-N-monoacetic acid (ASMA), aspartic acid-N,N-diacetic acid (ASDA),aspartic acid-N-monopropionic acid (ASMP), iminodisuccinic acid (IDA),N-(2-sulfomethyl) aspartic acid (SMAS), N-(2-sulfoethyl) aspartic acid(SEAS), N-(2-sulfomethyl) glutamic acid (SMGL), N-(2-sulfoethyl)glutamic acid (SEGL), N-methyliminodiacetic acid (MIDA),α-alanine-N,N-diacetic acid (α-ALDA), serine-N,N-diacetic acid (SEDA),isoserine-N,N-diacetic acid (ISDA), phenylalanine-N,N-diacetic acid(PHDA), anthranilic acid-N,N-diacetic acid (ANDA), sulfanilicacid-N,N-diacetic acid (SLDA), taurine-N,N-diacetic acid (TUDA) andsulfomethyl-N,N-diacetic acid (SMDA),N-(hydroxyethyl)-ethylidenediaminetriacetate (HEDTA), diethanolglycine(DEG), Diethylenetriamine Penta (Methylene Phosphonic acid) (DTPMP),aminotris(methylenephosphonic acid) (ATMP), and combinations and saltsthereof. Further exemplary builders and/or co-builders are described in,e.g., WO 09/102,854, U.S. Pat. No. 5,977,053

Bleaching Systems

The detergent may comprise a bleaching system. Any bleaching systemknown in the art for use in laundry detergents may be utilized. Suitablebleaching system components include bleaching catalysts, photobleaches,bleach activators, sources of hydrogen peroxide such as sodiumpercarbonate and sodium perborates, preformed peracids and mixturesthereof. Suitable preformed peracids include, but are not limited to,peroxycarboxylic acids and salts, percarbonic acids and salts, perimidicacids and salts, peroxymonosulfuric acids and salts, for example, Oxone(R), and mixtures thereof. Non-limiting examples of bleaching systemsinclude peroxide-based bleaching systems, which may comprise, forexample, an inorganic salt, including alkali metal salts such as sodiumsalts of perborate (usually mono- or tetra-hydrate), percarbonate,persulfate, perphosphate, persilicate salts, in combination with aperacid-forming bleach activator. By Bleach activator is meant herin acompound which reacts with peroxygen bleach like hydrogen peroxide toform a Peracid. The peracid thus formed constitutes the activatedbleach. Suitable bleach activators to be used herin include thosebelonging to the class of esters amides, imides or anhydrides, Suitableexamples are tetracetyl athylene diamine (TAED), sodium 3,5,5 trimethylhexanoyloxybenzene sulphonat, diperoxy dodecanoic acid,4-(dodecanoyloxy)benzenesulfonate (LOBS),4-(decanoyloxy)benzenesulfonate, 4-(decanoyloxy)benzoate (DOBS),4-(3,5,5-trimethylhexanoyloxy)benzenesulfonate (ISONOBS),tetraacetylethylenediamine (TAED) and 4-(nonanoyloxy)benzenesulfonate(NOBS), and/or those disclosed in WO98/17767. A particular family ofbleach activators of interest was disclosed in EP624154 and particularypreferred in that family is acetyl triethyl citrate (ATC). ATC or ashort chain triglyceride like Triacin has the advantage that it isenvironmental friendly as it eventually degrades into citric acid andalcohol. Furthermore acethyl triethyl citrate and triacetin has a goodhydrolytical stability in the product upon storage and it is anefficient bleach activator. Finally ATC provides a good buildingcapacity to the laundry additive. Alternatively, the bleaching systemmay comprise peroxyacids of, for example, the amide, imide, or sulfonetype. The bleaching system may also comprise peracids such as6-(phthaloylamino)percapronic acid (PAP). The bleaching system may alsoinclude a bleach catalyst. In some embodiments the bleach component maybe an organic catalyst selected from the group consisting of organiccatalysts having the following formulae:

(iii) and mixtures thereof; wherein each R¹ is independently a branchedalkyl group containing from 9 to 24 carbons or linear alkyl groupcontaining from 11 to 24 carbons, preferably each R¹ is independently abranched alkyl group containing from 9 to 18 carbons or linear alkylgroup containing from 11 to 18 carbons, more preferably each R¹ isindependently selected from the group consisting of 2-propylheptyl,2-butyloctyl, 2-pentylnonyl, 2-hexyldecyl, n-dodecyl, n-tetradecyl,n-hexadecyl, n-octadecyl, iso-nonyl, iso-decyl, iso-tridecyl andiso-pentadecyl. Other exemplary bleaching systems are described, e.g.,in WO2007/087258, WO2007/087244, WO2007/087259, WO2007/087242. Suitablephotobleaches may for example be sulfonated zinc phthalocyanine

Polymers

The detergent may contain 0-10% by weight, such as 0.5-5%, 2-5%, 0.5-2%or 0.2-1% of a polymer. Any polymer known in the art for use indetergents may be utilized. The polymer may function as a co-builder asmentioned above, or may provide antiredeposition, fiber protection, soilrelease, dye transfer inhibition, grease cleaning and/or anti-foamingproperties. Some polymers may have more than one of the above-mentionedproperties and/or more than one of the below-mentioned motifs. Exemplarypolymers include (carboxymethyl)cellulose (CMC), poly(vinyl alcohol)(PVA), poly(vinylpyrrolidone) (PVP), poly(ethyleneglycol) orpoly(ethylene oxide) (PEG), ethoxylated poly(ethyleneimine),carboxymethyl inulin (CMI), and polycarboxylates such as PAA, PAA/PMA,poly-aspartic acid, and lauryl methacrylate/acrylic acid copolymers,hydrophobically modified CMC(HM-CMC) and silicones, copolymers ofterephthalic acid and oligomeric glycols, copolymers of polyethyleneterephthalate and polyoxyethene terephthalate (PET-POET), PVP,poly(vinylimidazole) (PVI), poly(vinylpyridin-N-oxide) (PVPO or PVPNO)and polyvinylpyrrolidone-vinylimidazole (PVPVI). Further exemplarypolymers include sulfonated polycarboxylates, polyethylene oxide andpolypropylene oxide (PEO-PPO) and diquaternium ethoxy sulfate. Otherexemplary polymers are disclosed in, e.g., WO 2006/130575. Salts of theabove-mentioned polymers are also contemplated.

Fabric Hueing Agents

The detergent compositions of the present invention may also includefabric hueing agents such as dyes or pigments which when formulated indetergent compositions can deposit onto a fabric when said fabric iscontacted with a wash liquor comprising said detergent compositions thusaltering the tint of said fabric through absorption/reflection ofvisible light. Fluorescent whitening agents emit at least some visiblelight. In contrast, fabric hueing agents alter the tint of a surface asthey absorb at least a portion of the visible light spectrum. Suitablefabric hueing agents include dyes and dye-clay conjugates, and may alsoinclude pigments. Suitable dyes include small molecule dyes andpolymeric dyes. Suitable small molecule dyes include small molecule dyesselected from the group consisting of dyes falling into the Colour Index(C.I.) classifications of Direct Blue, Direct Red, Direct Violet, AcidBlue, Acid Red, Acid Violet, Basic Blue, Basic Violet and Basic Red, ormixtures thereof, for example as described in WO2005/03274,WO2005/03275, WO2005/03276 and EP1876226 (hereby incorporated byreference). The detergent composition preferably comprises from about0.00003 wt % to about 0.2 wt %, from about 0.00008 wt % to about 0.05 wt%, or even from about 0.0001 wt % to about 0.04 wt % fabric hueingagent. The composition may comprise from 0.0001 wt % to 0.2 wt % fabrichueing agent, this may be especially preferred when the composition isin the form of a unit dose pouch. Suitable hueing agents are alsodisclosed in, e.g., WO 2007/087257, WO2007/087243.

(Additional) Enzymes

The detergent additive as well as the detergent composition may compriseone or more additional enzymes such as a protease, lipase, cutinase, anamylase, carbohydrase, cellulase, pectinase, mannanase, arabinase,galactanase, xylanase, oxidase, e.g., a laccase, and/or peroxidase.

In general the properties of the selected enzyme(s) should be compatiblewith the selected detergent, (i.e., pH-optimum, compatibility with otherenzymatic and non-enzymatic ingredients, etc.), and the enzyme(s) shouldbe present in effective amounts.

Cellulases:

Suitable cellulases include those of bacterial or fungal origin.Chemically modified or protein engineered mutants are included. Suitablecellulases include cellulases from the genera Bacillus, Pseudomonas,Humicola, Fusarium, Thielavia, Acremonium, e.g., the fungal cellulasesproduced from Humicola insolens, Myceliophthora thermophila and Fusariumoxysporum disclosed in U.S. Pat. No. 4,435,307, U.S. Pat. No. 5,648,263,U.S. Pat. No. 5,691,178, U.S. Pat. No. 5,776,757 and WO 89/09259.

Especially suitable cellulases are the alkaline or neutral cellulaseshaving color care benefits. Examples of such cellulases are cellulasesdescribed in EP 0 495 257, EP 0 531 372, WO 96/11262, WO 96/29397, WO98/08940. Other examples are cellulase variants such as those describedin WO 94/07998, EP 0 531 315, U.S. Pat. No. 5,457,046, U.S. Pat. No.5,686,593, U.S. Pat. No. 5,763,254, WO 95/24471, WO 98/12307 andPCT/DK98/00299.

Commercially available cellulases include Celluzyme™, and Carezyme™(Novozymes A/S), Clazinase™, and Puradax HA™ (Genencor InternationalInc.), and KAC-500(B)™ (Kao Corporation).

Proteases:

Suitable proteases include those of animal, vegetable or microbialorigin. Microbial origin is preferred. Chemically modified or proteinengineered mutants are included. The protease may be a serine proteaseor a metalloprotease, preferably an alkaline microbial protease or atrypsin-like protease. Examples of alkaline proteases are subtilisins,especially those derived from Bacillus, e.g., subtilisin Novo,subtilisin Carlsberg, subtilisin 309, subtilisin 147 and subtilisin 168(described in WO 89/06279). Examples of trypsin-like proteases aretrypsin (e.g., of porcine or bovine origin) and the Fusarium proteasedescribed in WO 89/06270 and WO 94/25583.

Examples of useful proteases are the variants described in WO 92/19729,WO 98/20115, WO 98/20116, and WO 98/34946, especially the variants withsubstitutions in one or more of the following positions: 27, 36, 57, 76,87, 97, 101, 104, 120, 123, 167, 170, 194, 206, 218, 222, 224, 235, and274.

Preferred commercially available protease enzymes include Alcalase™,Savinase™ Primase™, Duralase™, Esperase™, and Kannase™ (Novozymes NS),Maxatase™, Maxacal™Maxapem™, Properase™, Purafect™, Purafect OxP™, FN2™,and FN3™ (Genencor International Inc.).

Lipases and Cutinases:

Suitable lipases and cutinases include those of bacterial or fungalorigin. Chemically modified or protein engineered mutants are included.Examples include lipase from Thermomyces, e.g., from T. lanuginosus(previously named Humicola lanuginosa) as described in EP 258 068 and EP305 216, cutinase from Humicola, e.g. H. insolens as described in WO96/13580, a Pseudomonas lipase, e.g., from P. alcaligenes or P.pseudoalcaligenes (EP 218 272), P. cepacia (EP 331 376), P. stutzeri (GB1,372,034), P. fluorescens, Pseudomonas sp. strain SD 705 (WO 95/06720and WO 96/27002), P. wisconsinensis (WO 96/12012), a Bacillus lipase,e.g., from B. subtilis (Dartois et al., 1993, Biochemica et BiophysicaActa, 1131: 253-360), B. stearothermophilus (JP 64/744992) or B. pumilus(WO 91/16422).

Other examples are lipase variants such as those described in WO92/05249, WO 94/01541, EP 407 225, EP 260 105, WO 95/35381, WO 96/00292,WO 95/30744, WO 94/25578, WO 95/14783, WO 95/22615, WO 97/04079, WO97/07202, WO 00/060063, WO2007/087508 and WO 2009/109500.

Preferred commercially available lipase enzymes include Lipolase™,Lipolase Ultra™, and Lipex™; Lecitase™, Lipolex™; Lipoclean™, Lipoprime™(Novozymes A/S). Other commercially available lipases include Lumafast(Genencor Int Inc); Lipomax (Gist-Brocades/Genencor Int Inc) andBacillus sp lipase from Solvay.

Amylases:

Suitable amylases (α and/or β) include those of bacterial or fungalorigin. Chemically modified or protein engineered mutants are included.Amylases include, for example, α-amylases obtained from Bacillus, e.g.,a special strain of Bacillus licheniformis, described in more detail inGB 1,296,839.

Examples of useful amylases are the variants described in WO 94/02597,WO 94/18314, WO 96/23873, and WO 97/43424, especially the variants withsubstitutions in one or more of the following positions: 15, 23, 105,106, 124, 128, 133, 154, 156, 181, 188, 190, 197, 202, 208, 209, 243,264, 304, 305, 391, 408, and 444.

Commercially available amylases are Stainzyme™, Natelase™, Duramyl™,Termamyl™, Fungamyl™ and BAN™ (Novozymes NS), Rapidase™ and Purastar™(from Genencor International Inc.).

Peroxidases/Oxidases:

Suitable peroxidases/oxidases include those of plant, bacterial orfungal origin. Chemically modified or protein engineered mutants areincluded. Examples of useful peroxidases include peroxidases fromCoprinus, e.g., from C. cinereus, and variants thereof as thosedescribed in WO 93/24618, WO 95/10602, and WO 98/15257.

Commercially available peroxidases include Guardzyme™ (Novozymes NS).

The detergent enzyme(s) may be included in a detergent composition byadding separate additives containing one or more enzymes, or by adding acombined additive comprising all of these enzymes. A detergent additiveof the invention, i.e., a separate additive or a combined additive, canbe formulated, for example, as a granulate, liquid, slurry, etc.Preferred detergent additive formulations are granulates, in particularnon-dusting granulates, liquids, in particular stabilized liquids, orslurries.

Non-dusting granulates may be produced, e.g., as disclosed in U.S. Pat.Nos. 4,106,991 and 4,661,452 and may optionally be coated by methodsknown in the art. Examples of waxy coating materials are poly(ethyleneoxide) products (polyethyleneglycol, PEG) with mean molar weights of1000 to 20000; ethoxylated nonylphenols having from 16 to 50 ethyleneoxide units; ethoxylated fatty alcohols in which the alcohol containsfrom 12 to 20 carbon atoms and in which there are 15 to 80 ethyleneoxide units; fatty alcohols; fatty acids; and mono- and di- andtriglycerides of fatty acids. Examples of film-forming coating materialssuitable for application by fluid bed techniques are given in GB1483591. Liquid enzyme preparations may, for instance, be stabilized byadding a polyol such as propylene glycol, a sugar or sugar alcohol,lactic acid or boric acid according to established methods. Protectedenzymes may be prepared according to the method disclosed in EP 238,216.

Adjunct Materials

Any detergent components known in the art for use in laundry detergentsmay also be utilized. Other optional detergent components includeanti-corrosion agents, anti-shrink agents, anti-soil redepositionagents, anti-wrinkling agents, bactericides, binders, corrosioninhibitors, disintegrants/disintegration agents, dyes, enzymestabilizers (including boric acid, borates, CMC, and/or polyols such aspropylene glycol), fabric conditioners including clays,fillers/processing aids, fluorescent whitening agents/opticalbrighteners, foam boosters, foam (suds) regulators, perfumes,soil-suspending agents, softeners, suds suppressors, tarnish inhibitors,and wicking agents, either alone or in combination. Any ingredient knownin the art for use in laundry detergents may be utilized. The choice ofsuch ingredients is well within the skill of the artisan.

Dispersants—

The detergent compositions of the present invention can also containdispersants. In particular powdered detergents may comprise dispersants.Suitable water-soluble organic materials include the homo- orco-polymeric acids or their salts, in which the polycarboxylic acidcomprises at least two carboxyl radicals separated from each other bynot more than two carbon atoms. Suitable dispersants are for exampledescribed in Powdered Detergents, Surfactant science series volume 71,Marcel Dekker, Inc.

Dye Transfer Inhibiting Agents—

The detergent compositions of the present invention may also include oneor more dye transfer inhibiting agents. Suitable polymeric dye transferinhibiting agents include, but are not limited to, polyvinylpyrrolidonepolymers, polyamine N-oxide polymers, copolymers of N-vinylpyrrolidoneand N-vinylimidazole, polyvinyloxazolidones and polyvinylimidazoles ormixtures thereof. When present in a subject composition, the dyetransfer inhibiting agents may be present at levels from about 0.0001%to about 10%, from about 0.01% to about 5% or even from about 0.1% toabout 3% by weight of the composition.

Fluorescent Whitening Agent—

The detergent compositions of the present invention will preferably alsocontain additional components that may tint articles being cleaned, suchas fluorescent whitening agent or optical brighteners. Where present thebrightener is preferably at a level of about 0.01% to about 0.5%. Anyfluorescent whitening agent suitable for use in a laundry detergentcomposition may be used in the composition of the present invention. Themost commonly used fluorescent whitening agents are those belonging tothe classes of diaminostilbene-sulphonic acid derivatives,diarylpyrazoline derivatives and bisphenyl-distyryl derivatives.Examples of the diaminostilbene-sulphonic acid derivative type offluorescent whitening agents include the sodium salts of:4,4′-bis-(2-diethanolamino-4-anilino-s-triazin-6-ylamino)stilbene-2,2′-disulphonate; 4,4′-bis-(2,4-dianilino-s-triazin-6-ylamino)stilbene-2,2′-disulphonate;4,4′-bis-(2-anilino-4(N-methyl-N-2-hydroxy-ethylamino)-s-triazin-6-ylamino)stilbene-2,2′-disulphonate,4,4′-bis-(4-phenyl-2,1,3-triazol-2-yl)stilbene-2,2′-disulphonate;4,4′-bis-(2-anilino-4(1-methyl-2-hydroxy-ethylamino)-s-triazin-6-ylamino)stilbene-2,2′-disulphonate and2-(stilbyl-4″-naptho-1.,2′:4,5)-1,2,3-trizole-2″-sulphonate. Preferredfluorescent whitening agents are Tinopal DMS and Tinopal CBS availablefrom Ciba-Geigy AG, Base1, Switzerland. Tinopal DMS is the disodium saltof 4,4′-bis-(2-morpholino-4 anilino-s-triazin-6-ylamino) stilbenedisulphonate. Tinopal CBS is the disodium salt of2,2′-bis-(phenyl-styryl) disulphonate. Also preferred are fluorescentwhitening agents is the commercially available Parawhite KX, supplied byParamount Minerals and Chemicals, Mumbai, India. Other fluorescerssuitable for use in the invention include the 1-3-diaryl pyrazolines andthe 7-alkylaminocoumarins.

Suitable fluorescent brightener levels include lower levels of fromabout 0.01, from 0.05, from about 0.1 or even from about 0.2 wt % toupper levels of 0.5 or even 0.75 wt %.

Soil Release Polymers—

The detergent compositions of the present invention may also include oneor more soil release polymers which aid the removal of soils fromfabrics such as cotton and polyester based fabrics, in particular theremoval of hydrophobic soils from polyester based fabrics. The soilrelease polymers may for example be nonionic or anionic terephthaltebased polymers, polyvinyl caprolactam and related copolymers, vinylgraft copolymers, polyester polyamides see for example Chapter 7 inPowdered Detergents, Surfactant science series volume 71, Marcel Dekker,Inc. Another type of soil release polymers are amphiphilic alkoxylatedgrease cleaning polymers comprising a core structure and a plurality ofalkoxylate groups attached to that core structure. The core structuremay comprise a polyalkylenimine structure or a polyalkanolaminestructure as described in detail in WO 2009/087523 (hereby incorporatedby reference). Furthermore random graft co-polymers are suitable soilrelease polymers Suitable graft co-polymers are described in more detailin WO 2007/138054, WO 2006/108856 and WO 2006/113314 (herebyincorporated by reference). Other soil release polymers are substitutedpolysaccharide structures especially substituted cellulosic structuressuch as modified cellulose deriviatives such as those described in EP1867808 or WO 2003/040279 (both are hereby incorporated by reference).Suitable cellulosic polymers include cellulose, cellulose ethers,cellulose esters, cellulose amides and mixtures thereof. Suitablecellulosic polymers include anionically modified cellulose, nonionicallymodified cellulose, cationically modified cellulose, zwitterionicallymodified cellulose, and mixtures thereof. Suitable cellulosic polymersinclude methyl cellulose, carboxy methyl cellulose, ethyl cellulose,hydroxylethyl cellulose, hydroxylpropyl methyl cellulose, ester carboxymethyl cellulose, and mixtures thereof.

Anti-Redeposition Agents—

The detergent compositions of the present invention may also include oneor more anti-redeposition agents such as carboxymethylcellulose (CMC),polyvinyl alcohol (PVA), polyvinylpyrrolidone (PVP), polyoxyethyleneand/or polyethyleneglycol (PEG), homopolymers of acrylic acid,copolymers of acrylic acid and maleic acid, and ethoxylatedpolyethyleneimines. The cellulose based polymers described under soilrelease polymers above may also function as anti-redeposition agents.

Other suitable adjunct materials include, but are not limited to,anti-shrink agents, anti-wrinkling agents, bactericides, binders,carriers, dyes, enzyme stabilizers, fabric softeners, fillers, foamregulators, hydrotropes, perfumes, pigments, sod suppressors, solvents,and structurants for liquid detergents and/or structure elasticizingagents.

Formulation of Detergent Products

The detergent composition of the invention may be in any convenientform, e.g., a bar, a homogenous tablet, a tablet having two or morelayers, a pouch having one or more compartments, a regular or compactpowder, a granule, a paste, a gel, or a regular, compact or concentratedliquid.

Detergent formulation forms: Layers (same or different phases), Pouches,versus forms for Machine dosing unit.

Pouches can be configured as single or multicompartments. It can be ofany form, shape and material which is suitable for hold the composition,e.g. without allowing the release of the composition to release of thecomposition from the pouch prior to water contact. The pouch is madefrom water soluble film which encloses an inner volume. Said innervolume can be devided into compartments of the pouch. Preferred filmsare polymeric materials preferably polymers which are formed into a filmor sheet. Preferred polymers, copolymers or derivates thereof areselected polyacrylates, and water soluble acrylate copolymers, methylcellulose, carboxy methyl cellulose, sodium dextrin, ethyl cellulose,hydroxyethyl cellulose, hydroxypropyl methyl cellulose, malto dextrin,poly methacrylates, most preferably polyvinyl alcohol copolymers and,hydroxyprpyl methyl cellulose (HPMC). Preferably the level of polymer inthe film for example PVA is at least about 60%. Preferred averagemolecular weight will typically be about 20,000 to about 150,000. Filmscan also be of blend compositions comprising hydrolytically degradableand water soluble polymer blends such as polyactide and polyvinylalcohol (known under the Trade reference M8630 as sold by Chris CraftIn. Prod. Of Gary, Ind., US) plus plasticisers like glycerol, ethyleneglycerol, Propylene glycol, sorbitol and mixtures thereof. The pouchescan comprise a solid laundry cleaning composition or part componentsand/or a liquid cleaning composition or part components separated by thewater soluble film. The compartment for liquid components can bedifferent in composition than compartments containing solids. Ref:(US2009/0011970 A1).

Detergent ingredients can be separated physically from each other bycompartments in water dissolvable pouches or in different layers oftablets. Thereby negative storage interaction between components can beavoided. Different dissolution profiles of each of the compartments canalso give rise to delayed dissolution of selected components in the washsolution.

Definition/Characteristics of the Forms

A liquid or gel detergent, which is not unit dosed, may be aqueous,typically containing at least 20% by weight and up to 95% water, such asup to about 70% water, up to about 65% water, up to about 55% water, upto about 45% water, up to about 35% water. Other types of liquids,including without limitation, alkanols, amines, diols, ethers andpolyols may be included in an aqueous liquid or gel. An aqueous liquidor gel detergent may contain from 0-30% organic solvent.

A liquid or gel detergent may be non-aqueous.

Granular Detergent Formulations

A granular detergent may be formulated as described in WO09/092,699,EP1705241, EP1382668, WO07/001,262, US6472364, WO04/074419 orWO09/102,854. Other useful detergent formulations are described inWO09/124,162, WO09/124,163, WO09/117,340, WO09/117,341, WO09/117,342,WO09/072,069, WO09/063,355, WO09/132,870, WO09/121,757, WO09/112,296,WO09/112,298, WO09/103,822, WO09/087,033, WO09/050,026, WO09/047,125,WO09/047,126, WO09/047,127, WO09/047,128, WO09/021,784, WO09/010,375,WO09/000,605, WO09/122,125, WO09/095,645, WO09/040,544, WO09/040,545,WO09/024,780, WO09/004,295, WO09/004,294, WO09/121,725, WO09/115,391,WO09/115,392, WO09/074,398, WO09/074,403, WO09/068,501, WO09/065,770,WO09/021,813, WO09/030,632, WO09/015,951, WO2011025615, WO2011016958,WO2011005803, WO2011005623, WO2011005730, WO2011005844, WO2011005904,WO2011005630, WO2011005830, WO2011005912, WO2011005905, WO2011005910,WO2011005813, WO2010135238, WO2010120863, WO2010108002, WO2010111365,WO2010108000, WO2010107635, WO2010090915, WO2010033976, WO2010033746,WO2010033747, WO2010033897, WO2010033979, WO2010030540, WO2010030541,WO2010030539, WO2010024467, WO2010024469, WO2010024470, WO2010025161,WO2010014395, WO2010044905, WO2010145887, WO2010142503, WO2010122051,WO2010102861, WO2010099997, WO2010084039, WO2010076292, WO2010069742,WO2010069718, WO2010069957, WO2010057784, WO2010054986, WO2010018043,WO2010003783, WO2010003792, WO2011023716, WO2010142539, WO2010118959,WO2010115813, WO2010105942, WO2010105961, WO2010105962, WO2010094356,WO2010084203, WO2010078979, WO2010072456, WO2010069905, WO2010076165,WO2010072603, WO2010066486, WO2010066631, WO2010066632, WO2010063689,WO2010060821, WO2010049187, WO2010031607, WO2010000636,

Method of Producing the Composition

The present invention also relates to methods of producing thecomposition.

Uses

-   -   The present invention is also directed to methods for using the        compositions thereof. Laundry/textile/fabric (House hold laundry        washing, Industrial laundry washing) Hard surface cleaning (ADW,        car wash, Industrial surface)

Use in detergents.

The polypeptides of the present invention may be added to and thusbecome a component of a detergent composition.

The detergent composition of the present invention may be formulated,for example, as a hand or machine laundry detergent compositionincluding a laundry additive composition suitable for pre-treatment ofstained fabrics and a rinse added fabric softener composition, or beformulated as a detergent composition for use in general household hardsurface cleaning operations, or be formulated for hand or machinedishwashing operations.

In a specific aspect, the present invention provides a detergentadditive comprising a polypeptide of the present invention as describedherein.

The gene may be obtained from any prokaryotic, eukaryotic, or othersource.

The present invention is further described by the following examplesthat should not be construed as limiting the scope of the invention.

EXAMPLES Strains

Aspergillus oryzae MT3568 strain was used for expression of the T.lapidaria gene encoding the polypeptides having cellulase activity. A.oryzae MT3568 is an amdS (acetamidase) disrupted gene derivative ofAspergillus oryzae JaL355 (WO 2002/40694) in which pyrG auxotrophy wasrestored by disrupting the A. oryzae acetamidase (amdS) gene with thepyrG gene.

Media and Solutions

LB plates were composed of 10 g of Bacto-Tryptone, 5 g of yeast extract,10 g of sodium chloride, 15 g of Bacto-agar, and deionized water to 1liter.

LB medium was composed of 10 g of Bacto-Tryptone, 5 g of yeast extract,and 10 g of sodium chloride, and deionized water to 1 liter.

YP medium was composed of 10 g of yeast extract, 20 g of Bactopeptone,and deionized water to 1 liter.

SOC medium: 20 g Bacto-Tryptone, 5 g Yeast extract and 0.5 g NaCl isdissolved in 1 liter of water. 10 ml 250 mM KCl is added and pH isadjusted to 7.0. The medium is autoclaved. After cooling to 60° C. 18 mlof sterile filtered 20% glucose and 5 ml of a 2M MgCl₂ solution isadded.

COVE Medium 20 ml salt solution 20 g agar 218 g sorbitol H₂O ad 1 lAutoclave and then add: 50 ml 20% glucose 10 mM NaNO₃. Salt solution: 26g KCl 26 g MgSO₄7H₂O 76 g KH₂PO₄ 50 ml trace element solution H₂O ad 1 lSucrose medium: 20 ml salt solution. 342 g sucrose. H₂O ad 1 l Autoclaveand then add: 10 mM NaNO₃ ST 0.6M sorbitol 100 mM Tris/HCl pH 7.0 STC1.2M sorbitol 10 mM CaCl₂ 10 mM Tris/HCl pH 7.5. PEG 60% (W/V) PEG 4000(BDH) in H₂O 10-20 sec. in microwave oven until it is dissolved, cooldown to RT then add 10 mM CaCl₂ 10 mM Tris/HCl pH 7.5. Topagar: 20 mlsalt solution 342 g sucrose 6 g Low melting agarose (BioWhittakerMolecular Applications 1-800-341-1574) It is important to adjust the pHto 6, with a few drops of 4M NaOH H₂O ad 1 L Autoclave Before use, boiland cold down to approx . . . 40° Amds selection: add 10 mM Acetamid og15 mM CsCl amdS selective plates 20 g agar 20 ml Saltsolution 342 gsucrose pH is adjusted to 6, with a few drops of 4M NaOH H₂O ad 1 lAutoclave 10 ml 1M acetamide 15 ml 1M CsCl

Methods Temperature Optimum

50 μL sample in appropriate dilution was mixed with 750 μLAZCL-HE-cellulose in 0.1M Na-phosphate. The samples were incubated attemperature covering 20 to 80° C. in thermo-mixers (1400 rpm agitation,20 min). After the incubation, unhydrolyzed substrate was removed bycentrifugation and 200 μL supernatant from each sample was transferredto a 96-hole microtitre plate. Absorbance at 600 nm was determined usinga PowerWave microtitre plate reader. Activity was expressed asabsorbance units.

Specific Activity Analysis Specific Activity on PASO (AmorphousCellulose; Phosphoric Acid Swollen Cellulose)

20 g PASO (amorphous cellulose; phosphoric acid swollen cellulose) (10%w/v) was centrifuged for 10 min, 5000 rpm. Supernatant was removed andthe pellet was resuspended in 40 mL 0.1M Na-phosphate buffer followed bya similar centrifugation step. The insolubles were resuspended in 20 mL0.1M Na-phosphate buffer. Substrate solution was preincubated at 40° C.for 5 min and 500 μL enzyme solution was added.

Enzyme solution/ Substrate (μL) Buffer (μL) glu standard (μL) 500 1500500

The sample was mixed and incubated for 20 min at 40° C. The reaction wasstopped by adding 500 μL 2% NaOH. The samples were centrifuged for 15min at 5000 rpm. 1000 μL supernatant was mixed with 500 μLPHBAH-solution. The samples were boiled for 10 min and cooled. A410 wasmeasured on all samples. The standard curve was done by replacing theenzyme solution with glucose with known concentrations (0-50 mg/L).Enzyme blanks were also added for each enzyme dilution (enzyme was addedafter the addition of 2% NaOH).

PHBAH Solution

1 g PHBAH in 100 mL 2% NaOH. 100 μL Bismuth solution was added.

Bismuth Solution

1 mol Bismuth nitrate

1 mol K/Na-tartrat

3 mol NaOH (solid)

100 mL H₂O

Specific activity on carboxymethyl-cellulose was determined using CMC7LF. A stock solution of 1% CMC was prepared. 1500 μL substrate wasmixed with 500 μL diluted enzyme. The samples were incubated for 20 minat 40° C. on temperate water batch. After 20 min, 1000 μL PHBAH/bismuth(1.5 g PHBAH+5 g K/Na tartrate+100 mL 2% NaOH) stop reagent was added.Samples were boiled for 10 min and were immediately cooled on ice. Theabsorbance was read at 410 nm. A standard curve was also mixed usingglucose (0 to 50 mg/L).

Wash Performance

Wash performance is testes as anti-redeposition test in mini-TOM withdirty detergent (5 ppm carbon black in ½ dose Liquid Persil small &mighty non-Bio) at 24 FH and 25° C.

The trials are run in 250 ml beakers. 8 pieces of pre-washed wfk 80A(pre-wash procedure see below), 5×5 cm are loaded in the beakerstogether with flotte: 100 ml, 1.25 mL/L detergent, 24 FH (13.45° dH)(Ca2+/Mg2+=2:1, no HCO₃), pH as in the detergent, 5 ppm carbon black andthe enzyme sample. The beakers are shaken, 125-150 rpm at a temperatureof 25° C. for three hours. Swatches are rinsed in running tap water.

There are 8 swatches. After 1 h and 2 h a swatch is removed from theload. I.e. 6 swatches are included in 3 h.

Carbon: A stock solution of carbon black is made by dispersing 100 mgcarbon black in 50 mL flotte providing a concentration of 2 mg per mL.The flotte is to contain 5 ppm carbon; i.e. 2.5 ml stock solution perlitre flotte is added. The mixture is stirred for half an hour beforeaddition to the detergent solution as well as during the flotte duringportioning.

Test/Analysis

Macbeth Color-Eye measurement of swatches at 500 nm after 1 h, 2 h and 3h incubation in dirty detergent. In all cases measurement is donethrough 2-6 layers of fabric. Each swatch is measured 2 times, 1 time onfront and 1 time on bag site, with aperture large and observation D65daylight.

Pre-Wash wfk 80A

Desizing: Washed in Miele 60° C. cotton standard wash program in tapwater, with 4 g/L IEC A* without CMC and Enzyme and 1.2 mg ep/L (0.09g/L or 2.2%) Stainzyme (Novozymes NS, Bagsvaerd Denmark).

Prewash: Washed in Miele 95° C. cotton standard wash program with extrarinse in tap water, with 4 g/L IEC A* without CMC and Enzyme

Rinse: Rinsed in Miele standard rinse program in deionized water.

Example 1 Isolation of the Terebella GH9 Cellulase

In a DNA sequencing project based on the marine organisms the marinepolychaete Terebella lapidaria was collected from Ria de Formosa, TheAlgarve, Southern Portugal. Nucleic acid material was isolated and twochannels of Terebella RNA was sequenced with RNA-sep technology in anillumine HTS sequences in accordance with the manufacturersinstructions.

A GH9 hydrolase was identified in the project having the nucleotidesequence disclosed in SEQ ID NO: 1. The gene encodes a polypeptide of428 amino acid having the structure of a GH9 cellulase with the aminoacid sequence shown in SEQ ID NO: 2.

Example 2 Expression of the Terebella GH9 Cellulase Preparation ofExpression Construct

The coding sequence for the Terebella cellulose was PCR cloned using theprimers

TLGH9F (SEQ ID NO: 3) GTGAAGCGTACGCGTTACGACTACGCCGATGCGCTGG TLGH9R(SEQ ID NO: 4) AGATCTCGAGAAGCTTATCACAAGCCCAGACTGAGGAGACC

The primers contain MluI and HindIII restriction sites for integrationof the isolated fragment into the selected expression vector. Theunderlined region of TLGH9F represents the MluI site used for the inframe fusion between the Candida B lipase signal-pre pro region and thepredicted mature peptide of the GH9.

cDNA fragments generated by Marathon cDNA Amplification kit fromClontech (cat. Nr 634913) according to the manufacturer's instructions.The resulting cDNA fragments were used as template. The primers abovewere used in a PCR reaction composed of 100 ng of Terebella lapidariacDNA, The cDNA was diluted in TE Buffer (10 mM Tris, 1 mM EDTA) pH 8.0to 100 ng/μl. A PTC-200 DNA engine (MJ Research Inc., Waltham, Mass.,USA) was used to perform the PCR reaction. Extensor Long PCR Master Mix,Buffer 1, ReddyMix™ version (AB Gene Ltd., Epsom United Kingdom) wasused for the PCR amplification. The master mix contains buffer, dNTPsand a thermostable polymerase blend.

The PCR reaction was composed of 20 μl of Extensor Long PCR Master Mix,Buffer 1, ReddyMix™ version, 1 μl of primer TLGH9F (100 μM), 1 μl ofprimer TLGH9R (100 μM), 1 μl of T. lapidaria cDNA (100 ng/μl), and 17 μlof deionized water in a total volume of 40 μl. The PCR conditions were 1cycle at 95° C. for 2 minutes. 30 cycles each at 94° C. for 10 seconds,55° C. for 30 seconds, and 68° C. for 1 minute; and 1 cycle at 68° C.for 10 minutes. The sample was then held at 10° C. until removed fromthe PCR machine.

The reaction product was isolated by 1.0% agarose gel electrophoresisusing 40 mM Tris base, 20 mM sodium acetate, 1 mM disodium EDTA (TAE)buffer where a 1.3 kb product band was excised from the gel and purifiedusing an illustra GFX® PCR DNA and Gel Band Purification Kit (GEHealthcare Life Sciences, Brondby, Denmark) according to themanufacturer's instructions. The fragment was then cloned into Mlel andHindIII digested pDau222 using an IN-FUSION™ Cloning Kit resulting inplasmid pGH9-7-2.

The vector used was pDau222, an Aspergillus expression plasmid based onpDau109 (WO 2005/042735). To create pDau222 from pDau109, the followingBamHI-HindIII flanked insert was used to replace the BamHI-HindIIIinsert of pDau222:

(SEQ ID NO: 5) GGATCCACCATGAAGCTACTCTCTCTGACCGGTGTGGCTGGTGTGCTTGCGACTTGCGTTGCAGCCACTCCTTTGGTGAAGCGTACGCGTGCGCATCACCATCACCATCACTAAGCTT.

The underlined portions denote the BamHI, Mlul and HindIII restrictionsites respectively. The codons representing the KexB cleavage site areshown in bold. The insert enables the fusion of a signal less codingregion of interest with the Candida anartica lipase B (CaIB) secretionsignal and pre pro region via utilization of the MluI cloning site. TheATG start codon of the CaIB region is optimally placed for expressionfrom the NA2-tpi promoter. Placement of the coding region of interestbetween the Mlul site and the HindIII site can result in a secretedprotein that is further cleaved by the KexB processing protease to amature protein. A translation of the insert from the ATG start codon isas follows:

(SEQ ID NO: 6) MKLLSLTGVAGVLATCVAATPLVKRTRAHHHHHH

The cleavable region of the CaIB signal is underlined and the KexBprotease cleavage site is shown in bold. The poly histidines displayedare available for His-Tag C terminal fusions or can be omitted from theexpression by presence of a stop codon to the coding region of interest.

The cloning of the pGH9-7-2 gene into Mlel HindIII digested pDau222resulted in the transcription of the Terebella lapidaria open readingframe under the under the control of a NA2-tpi double promoter. NA2-tpiis a modified promoter from the gene encoding the Aspergillus nigerneutral alpha-amylase in which the untranslated leader has been replacedby an untranslated leader from the gene encoding the Aspergillusnidulans triose phosphate isomerase.

For cloning the isolated PCR fragment and 300 ng pDau222 precut withMlul and HindIII were mixed in a 2:1 molar ratio in 10 μl deionizedwater. The aluminium seal was removed from a In-Fusion reaction tube kitand 10 μl of the mixture containing fragment and vector was added andcarefully mixed. The mixture was incubated at room temperature (appx 25°C.) for 30 minutes and thereafter the tube was transferred to ice. TheIn-fusion reaction mixture was diluted with 40 μl TE buffer (10 mMTris-HCl, 1 mM EDTA pH 8.0) and stored at −20° C. until use.

The diluted reaction mixture was transformed into competent Fusion BlueCompetent cells (Clontech Inc.) using the instructions of themanufacturer. 4 μl of the diluted reaction mixtire was gently mixed with50 μl of the thawed competent cells and the mixture incubated 30 minuteson ice. The cells are heat shocked in a water bath at 42° C. for 45 sand thereafter returned to ice for 1 minute. After heat showing 250 μlSOC medium was added to the cells and the mixture was incubated at 37°C. for 90 minutes with shaking at 220 rpm. 20 μl of the cells werespread on a LB+50 μg/ml ampicillin, the remaining cells were spread onanother plate and the plates were incubated over night at 37° C.

Next day several colonies appeared on the LB plates. 10 random colonieswere picked and grown over night in 3 ml LB+100 μg/ml ampicillin,plasmid DNA was prepared from these colonies using the “FASTPlasmid minikit” from 5Prime according to the manufacturers instructions. Theisolated plasmids were sequenced in order to confirm the sequence (datanot shown). One clone with correct sequence was selected and incubatedover night in 50 ml LB+50 μg/ml ampicillin at 37° C. at 225 rpm. PlasmidDNA was prepared from the culture using a Qiagen Compact Prep midi kit,cat.nr: 12743. Protocol according to manufacturer.

Transformation of Aspergillus Expression Host

2.5 μg of the prepared expression construct was transformed intoAspergillus oryzae MT3568 using the following protocol:

-   -   An agar slant (COVE medium) is inoculated with spores of the        strain in question (MT3568), and the strain is grown at 37° C.        for approx 6 days (until it is completely sporulated). Slants        can be stored in the cold for approx 2 month.    -   5-10 ml YP medium is added to the slant, and the spores are        suspended. The spore suspension is transferred to a 500 ml        Nalgene plastic shakeflask, containing 100 ml sucrose medium.        The flask is incubated at 30° C. for 20 hr (270 rpm).    -   The mycelium is collected by filtration through mira cloth        (Calbiochem Cat. No 475855), and it is washed using 200 ml 0.6 M        MgSO₄. The remaining liquid is squeezed out of the mycelium        using a plastic pipette.    -   1-2 g of the mycelium is transferred to a 125 ml Nalgene plastic        erlenmeyer flask. 15 ml 1.2 M MgSO₄, 10 mM NaH₂PO₄ pH 5.8        containing 750 mg Glucanex® 200 G. (Novozymes Bagsvaerd,        Denmark) batch KM629002), is added, and the mycelium is        suspended. Put on ice for 5 minutes. 1 ml of 12 mg/ml BSA        (sterile filtered) is added.    -   The suspension is incubated at 37° C. for 1, 5-2 hours, and the        protoplasting is monitored frequently by microscopy.    -   The protoplast suspension is filtered through mira cloth into a        25 ml NUNC tube (or similar), and the suspension is overlaid        with 5 ml ST (be careful not to mix up the lower layer), and the        protoplasts are banded by centrifugation (2500 rpm, 15 min, slow        acc.). The interface band of protoplasts are recovered using a        P1000 pipette and transferred to a fresh tube.    -   The protoplasts are diluted with 2 volumes of STC followed by        centrifugation (2500 rpm, 5 min). The protoplasts are washed        twice with STC (using resuspension and centrifugation), and are        then resuspended in STC to a concentration of approx 5×10⁷        protoplasts/ml. In most case resupending in 4-5 ml of STC will        give the correctly concentration.    -   For each transformation, DNA is added at to bottom in a tube,        and 100 μl of protoplasts are added. After 10 minutes of        incubation (RT), 250 μl of PEG is added, and the tube is gently        mixed by hand. After 30 minutes of incubation)(37°, 4 ml of        topagar (temp 40°) is added. The protoplasts are plated onto        selective plates.    -   The plates are incubated at 37° C. until transformants are        clearly visible and start to sporulate.

16 Transformants were picked and grown in dep well microtiter platescontaining 750 ml/well of YP+2% glucose for 4 days at 26° C.Supernatants from the transformants were tested for cellulose activityon AZCL-He-cellulose in microtiter plates. 50 μl supernatant was mixedwith 200 μl AZCL-He-cellulose mixture (0.1% AZCL-He-Cellulose in 0.2MMOPS buffer, pH 7.0) and incubated at 26° C. over night. Thesupernatants from all transformants has activity in this assaydemonstrating that the Terebella GH9 cellulase has activity onAZCL-He-cellulose.

Example 3 Purification and Characterization of the CellulasePurification of the Cellulase

One transformant prepared in example 2, was grown in 1 liter YP+2%glucose medium 4 days at 26° C. in shake flasks at 225 rpm, and thesupernatant collected for purification of the cloned cellulose.

Culture broth was sterile filtered through 0.2 μm PES bottle-topfilters. 800 ml broth was concentrated using ultra filtration (Sartorium10 kDa PES membrane) resulting in 250 ml concentrate, which was mixedwith an equal volume of a 2M (NH₄)₂SO₄ solution in 40 mM Tris-HCl, pH8.0. The precipitate was removed by centrifugation and the fluid wasloaded on a 16 ml PhenylSepharose HP column. Unbound material wasremoved by washing with the binding buffer (1 M (NH₄)₂SO₄ solution in 20mM Tris-HCl, pH 8.0). Elution was done by a linear decreasing gradient(NH₄)₂SO₄ (1.0M to 0.0 M). Fractions of 10 ml were collected andanalysed for endo-glucanase activity on AZCL-HE-cellulose.

Activity analysis was carried out using 0.2% AZCL-HE-cellulose. Briefly,750 μL substrate slurry in 50 mM Na-Phosphate buffer, pH 7.5 wasincubated with 50 μL sample. The samples were incubated for 20 min on athermo mixer with full agitation (1400 rpm, 40° C.). Samples developingblue color were treated as endoglucanase positive. The endoglucansepositive fractions were analyzed for purity using SDS-PAGE. Fractionsevaluated as pure were pooled.

Protein concentration was determined by measuring A₂₈₀ and ε=127590M⁻¹cm⁻¹ and a theoretical molecular weight of 47601 g/mol. Theseparameters were calculated using the software GPMAW (Lighthousesoftware, DK).

The estimated purify was >95% pure based on visual determination from anSDS-PAGE. The apparent molecular weight was 50 kDa, a value in agreementwith the expected molecular weight.

Protein concentration of the purified preparation was estimated to 0.77mg/mL.

pH Optimum

The pH optimum was determined using amorphous cellulose (PASO) andreducing sugar quantification. Max activity was observed at pH 6 but theenzyme also displayed>50% activity from pH 5 to 9 (see FIG. 1 Error!Reference source not found.).

Specific Activity

Substrate Specific activity Conditions CMC 270 IU/mg Reducing ends, pH7.0, 40° C. PASO  11 IU/mg Reducing ends, pH 7.0, 40° C.

Thus, the activity on CMC was done by measuring the liberation ofreducing sugars using PHBAH as reagent. The analysis revealed a specificactivity of 270 IU/mg. Many of the endoglucanases described in2008-07497-20 have much lower specific activity on CMC. However, CMC isan artificial cellulose substrate. A more “real life” substrate isphosphoric acid swollen cellulose, which is believed to consist ofmainly amorphous cellulose. The observed specific activity on thissubstrate was 11 IU/mg. As a comparison, approximiately 80 IU/mg hasbeen reported for Humicola insolens cellulase (WO9117243).

Temperature Profile

The temperature profile is presented in FIG. 2. This enzyme did not showany signs of being a low temperature cellulase. On the contrary, thetemperature optimum was 60° C. and at 70° C. it displayed 70% of itsmaximum activity.

Example 4 Wash Performance

The wash performance of the Terebella cellulase was tested using was ananti-redeposition test, where white cotton fabric is washed in adetergent solution supplemented with carbon black. It has been shownpreviously that the addition of an endoglucanase can reduce the amountof carbon black sticking to the fabric in these rather stressed washconditions. The textiles were washed in three wash cycles to maximizethe wash effects.

The cellulase of the invention was tested and compared with thecommercial endo-glucanase Endolase© (GH7) and the xylo-glucanaseWhitezyme ©(GH44) both available at Novozymes NS Denmark.

The test results indicated that Endolase showed the strongest ability toprevent carbon black from depositing on the textile. Whitezyme and T.lapidaria cellulase showed similar response (see FIG. 3).

The invention described and claimed herein is not to be limited in scopeby the specific aspects herein disclosed, since these aspects areintended as illustrations of several aspects of the invention. Anyequivalent aspects are intended to be within the scope of thisinvention. Indeed, various modifications of the invention in addition tothose shown and described herein will become apparent to those skilledin the art from the foregoing description. Such modifications are alsointended to fall within the scope of the appended claims. In the case ofconflict, the present disclosure including definitions will control.

1-15. (canceled)
 16. An isolated polypeptide having cellulase activity,selected from the group consisting of: (a) a polypeptide having at least60% sequence identity to the mature polypeptide of SEQ ID NO: 2; (b) apolypeptide encoded by a polynucleotide that hybridizes under lowstringency conditions with the mature polypeptide coding sequence of SEQID NO: 1 or the full-length complement thereof; (c) a polypeptideencoded by a polynucleotide having at least 60% sequence identity to themature polypeptide coding sequence of SEQ ID NO: 1 or the genomic DNAsequence thereof; or (d) a variant of the mature polypeptide of SEQ IDNO: 2 comprising a substitution, deletion, and/or insertion at one ormore positions; and (e) a fragment of the polypeptide of (a), (b) or (c)that has cellulase activity.
 17. The polypeptide of claim 16, having atleast 70% sequence identity to the mature polypeptide of SEQ ID NO: 2.18. The polypeptide of claim 16, having at least 80% sequence identityto the mature polypeptide of SEQ ID NO:
 2. 19. The polypeptide of claim16, having at least 90% sequence identity to the mature polypeptide ofSEQ ID NO:
 2. 20. The polypeptide of claim 16, having at least 95%sequence identity to the mature polypeptide of SEQ ID NO:
 2. 21. Thepolypeptide of claim 16, which is encoded by a polynucleotide thathybridizes under high stringency conditions with (i) the maturepolypeptide coding sequence of SEQ ID NO:1, (ii) the genomic DNAsequence thereof, or (iii) the full-length complement of (i) or (ii).22. The polypeptide of claim 16, which is encoded by a polynucleotidehaving at least 60% sequence identity to the mature polypeptide codingsequence of SEQ ID NO: 1 or the genomic DNA sequence thereof.
 23. Thepolypeptide of claim 16, comprising or consisting of SEQ ID NO:
 2. 24.The polypeptide of claim 16, comprising or consisting of the maturepolypeptide of SEQ ID NO:
 2. 25. The polypeptide of claim 24, whereinthe mature polypeptide is amino acids 1 to 428 of SEQ ID NO:
 2. 26. Thepolypeptide of claim 16, which is a fragment of SEQ ID NO: 2, whereinthe fragment has cellulase activity.
 27. A detergent compositioncomprising a polypeptide of claim 16 and a surfactant.
 28. A nucleicacid construct or expression vector comprising a polynucleotide encodinga polypeptide of claim 16, wherein the polynucleotide is operably linkedto one or more control sequences that direct the production of thepolypeptide in an expression host.
 29. A recombinant host cellcomprising a polynucleotide encoding a polypeptide of claim 16, whereinthe polynucleotide is operably linked to one or more control sequencesthat direct the production of the polypeptide.
 30. A method of producinga polypeptide having cellulase activity, comprising: (a) cultivating thehost cell of claim 29 under conditions conducive for production of thepolypeptide; and (b) recovering the polypeptide.